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ABSTRACT 


This  study  investigated  whether  a  combination  of  nineteen 
predictor  variables  could  differentiate  between  nine  training  groups 
of  graduates  from  the  Northern  Alberta  Institute  of  Technology.  The 
purpose  of  the  study  was  to  develop  a  procedure  for  predicting  training 
outcomes  of  students  in  a  technological  institute. 

Discriminant  analysis  was  used  in  determining  the  reduced  space 
in  which  the  separation  of  training  groups  was  maximum.  The  person's 
centour  score  was  used  as  a  basis  of  determining  his  group  membership. 
Various  combinations  of  predictors  and  training  groups  were  analyzed 
and  the  results  of  these  classifications  compared.  The  effectiveness 
of  classification  was  verified  from  a  stratified  random  sample  of  grad¬ 
uates  which  was  set  aside  for  validation.  It  was  also  verified  from  the 
criterion  sample  used  in  the  analysis. 

Classification  of  group  membership  was  determined  by  the  con¬ 
ditional  and  Bayesian  rules  of  probability.  Comparison  of  classifica¬ 
tion  results  were  made  in  terms  of  number  and  corresponding  percentages 
of  correct  classifications,  close  classifications,  fairly  close  class¬ 
ifications,  overall  classifications,  and  misclassif ication. 

Out  of  the  five  discriminant  analyses  carried  out,  analysis  2, 
involving  eight  training  groups  and  nineteen  predictors,  showed  the 
best  result  of  classification.  The  discriminant  weights  obtained  in  this 
analysis  were  applied  on  a  sample  of  non-graduates  to  verify  the 
classification  procedure  employing  a  different  group  of  students.  The 
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result  of  the  classification  showed  a  reversed  trend.  This  was  another 


indication  that  the  prediction  scheme  developed  in  this  study  could  be 
a  useful  tool  for  counseling. 

Reducing  the  number  of  training  groups  by  combining  two  of  them 
showed  an  improvement  in  correct  classification,  but  not  always  in 
accuracy.  When  the  analysis  was  limited  to  highly  weighted  predictors, 
correct  classifications  were  found  to  be  lower  than  when  19  predictors 
were  used.  Twelve  predictors,  which  were  readily  available  to  counselors 
at  NAIT,  could  probably  have  served  the  purpose  of  counseling  although 
with  a  lesser  degree  of  effectiveness. 

Psychological  meanings  were  derived  from  the  distribution  of 

<• 

scores  of  the  graduate  sample  in  a  3 -dimensional  discriminant  space. 

The  first  function  was  interpreted  as  measuring  a  mechanical  attribute, 
the  second  as  an  intellectual  attribute,  and  the  third  as  a  verbal- 
enterprising  attribute. 

The  analysis  carried  out  to  determine  whether  19  predictors 
could  differentiate  between  two  outcome  categories,  showed  that  it 
was  not  possible  to  differentiate  between  graduates  and  non-graduates. 

One  reason  advanced  was  that  this  study  used  a  restricted  group. 

The  study  demonstrated  that  19  predictor  variables  could  be 
analyzed  to  develop  a  workable  prediction  scheme  for  purposes  of 
counseling  in  a  technological  institute. 
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CHAPTER  I 


INTRODUCTION 

Super  (1954)  viewed  vocational  counseling  in  the  United  States 
and  Canada  as  "human  development"  while  viewing  it  as  "manpower  utiliz¬ 
ation"  in  countries  with  disturbed  economies.  In  the  latter  case  its 
purpose  is  primarily  a  means  of  obtaining  the  needed  supply  and  distribu¬ 
tion  of  trained  manpower.  Human  development  on  the  other  hand  implies 
the  development  of  the  individual  to  the  full  maximum  of  his  potential¬ 
ities  and  to  his  own  satisfaction.  In  the  North  American  cultural 
framework  vocational  counseling  is  more  extensive  in  scope  than  else¬ 
where.  It  places  more  demands  on  the  counselor  to  apply  various  methods 
and  procedures  in  assisting  the  student  to  attain  that  full  measure  of 
human  development  and  satisfaction. 

When  a  student  comes  to  him  for  advice  with  regard  to  choosing 
an  occupation  the  counselor  studies  the  characteristics  of  the  individ¬ 
ual  and  looks  over  occupations  in  which  others  with  characteristics  like 
his  were  successful  and  satisfied.  To  secure  this  information  which 
consists  of  antecedent-consequent  relationships  the  counselor  uses 
both  the  statistical  and  clinical  approaches.  As  used  here,  statistical 
approach  refers  to  test  interpretation  and  prediction.  The  clinical 
approach  refers  to  inferential,  subjective,  and  nonquantitative  pre¬ 
diction.  The  clinical  approach  will  not  be  treated  in  this  dissertation. 

One  of  the  most  common  statistical  bridges  the  guidance  counselor 
uses  in  his  work  is  the  test  norm.  This  procedure  involves  a  direct 
comparison  of  the  individual's  raw  test  score  or  converted  score  with 
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some  table  of  norms.  It  is  based  on  the  assumption  that  the  norm  group 
represents  a  meaningful  base  for  comparison  and  prediction.  Goldman 
(1961)  observed  that  the  greatest  danger  in  the  use  of  norms  is  that 
more  will  be  assumed  for  a  normative  statement  than  is  warranted.  Another 
danger  is  the  unwarranted  assumption  that  the  higher  the  score,  in  compar¬ 
ison  with  the  norm  group,  the  better.  In  spite  of  their  limitations,  test 
norms  have  some  practical  uses  in  vocational  counseling. 

For  many  years  the  mainstay  of  test  interpretation  and  prediction 
in  vocational  counseling  has  been  the  regression  approach.  In  this  pro¬ 
cedure  prediction  equations  are  computed,  based  on  the  linear  relation¬ 
ship  between  test  scores  and  some  criteria  of  success  or  satisfaction. 

The  multiple  regression  technique  is  used  to  select  the  most  effective 
combination  of  tests,  determine  their  respective  weights  and  assess 
the  effectiveness  of  the  composite  predictor.  By  substituting  a  person's 
test  scores  in  the  multiple  regression  formula,  his  predicted  criterion 
score  can  be  determined  and  made  use  of  in  vocational  counseling. 

However,  there  are  some  difficulties  with  the  use  of  the  multiple 
regression  approach  in  test  interpretations  and  prediction.  Rulon  (1951) 
pointed  out  that  it  is  almost  always  assumed  in  this  approach  that  the 
person  in  question  is  a  member  of  the  group.  This  is  so  because  the 
regression  equation  is  developed  on  the  basis  of  persons  who  have  already 
entered  a  particular  group.  It  cannot  be  denied  that  there  are  many 
selection  factors  involved  before  the  individual  becomes  a  member  of 
the  group.  Cooley  and  Lohnes  (1962)  cited  an  example  of  chemistry 
majors  who  might  all  tend  to  be  high  on  mathematics  aptitude.  By  the 
multiple  regression  procedure  the  importance  of  mathematics  aptitude 
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would  not  be  reflected  in  predicting  performance  as  a  chemistry  major. 
This  might  result  in  misclassif ication.  With  regression  information 
the  individual  is  more  likely  to  appear  to  have  the  highest  aptitude 
for  the  less  difficult  task  hence  the  tendency  for  such  an  individual 
to  enter  or  be  assigned  to  less  difficult  tasks.  Although  the  results 
of  the  regression  calculation  indicate  relationships  between  predictors 
and  criteria  for  individuals  within  groups  it  is  lacking  with  regards 
to  relationships  between  groups.  In  his  work  with  classification  and 
selection  for  Air  Force  personnel,  Tiedeman  (1951)  pointed  out  a  further 
difficulty  in  using  regression  analysis.  The  problem  is  in  making 
criterion  scales  comparable  from  one  job  to  the  next. 

In  spite  of  these  limitations,  correlation  and  regression  an¬ 
alysis  do  give  useful  information  and  have  contributed  to  a  large 
extent  in  the  development  of  test  interpretation  and  prediction  and 
in  the  advancement  of  psychological  knowledge.  The  approach  is  useful 
in  predicting  the  group  in  which  an  individual  will  be  most  successful. 
It  is  also  useful  when  individuals  in  a  group  are  ranked  according  to 
their  predicted  ability  to  perform  a  particular  job  in  order  that  a 
certain  number  may  be  selected  from  the  top. 

However,  in  personnel  classification  and  vocational  counseling 
one  is  faced  with  the  question  "To  which  group  does  an  individual  bear 
the  greatest  resemblance?",  before  asking  the  question,  "How  well  will 
the  individual  perform  if  assigned  to  the  group  he  most  resembles?". 
Surely,  the  answer  to  the  first  question  would  provide  vocational 
counseling  with  an  effective  means  of  uncovering  promising  careers  for 
the  individual  and  assisting  him  to  make  an  appropriate  choice.  This 
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may  enable  the  individual  to  set  his  goal  toward  the  most  rewarding 
vocation  for  him  as  well  as  to  attain  the  purposes  of  vocational  coun¬ 
seling  which  Super  termed  as  "human  development".  Yet  while  the  first 
question  is  basic  in  helping  an  individual  make  proper  choices,  the 
use  of  an  appropriate  statistical  technique  such  as  discriminant 
analysis  is  rarely  if  ever  used  in  school  guidance.  The  reason  for  this 
situation  is,  perhaps,  the  need  for  well-informed  personnel  to  carry 
out  possible  research  applications.  Another  reason  is  that  the  determin¬ 
ation  of  the  function  requires  voluminous  calculations  when  the  analysis 
involves  three  or  more  variables  and  groups.  However,  with  the  aid  of 
the  medium  speed  electronic  computer,  which  is  becoming  available  in 
many  school  systems,  the  above  reasons  may  now  be  overcome. 

Multiple  discriminant  analysis  originated  with  Fisher.  It  is 
widely  used  in  taxonomic  scientific  investigations  such  as  dating  a 
series  of  Egyptian  skulls,  distinguishing  between  two  forms  of  black 
locust  trees,  and  classifying  individuals  into  three  Indian  castes. 

These  earlier  investigations  mentioned  above  were  carried  out  by  M. 
Barnard,  Fisher,  and  Rao  respectively,  (Tiedeman,  1951).  Since  then 
this  technique  has  been  used  in  many  scientific  classification  and 
selection  problems.  Its  application  to  psychology  is  relatively 
recent  and  not  widely  developed.  Current  basic  statistical  textbooks 
in  psychology  and  education  rarely  provide  a  complete  explanation  re¬ 
garding  methods  of  calculating  the  discriminant  function  as  opposed  to 
the  widely  known  multiple  regression  analysis. 

The  early  investigators  who  conducted  studies  using  discriminant 
analysis  most  often  concerned  themselves  with  demonstrating  its  effect¬ 
iveness  and  discriminating  power  in  psychological  classification  problems. 
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Sufficient  evidence  has  now  accumulated  indicating  the  usefulness  of 
the  multiple  discriminant  function  as  a  statistical  tool  for  certain 
psychological  investigations.  The  recent  direction  of  studies  in  this 
regard  has  shifted  to  investigation  of  predictor  variables  and  group 
categories  that  may  be  useful  in  identifying  special  talents,  in  de¬ 
termining  vocational  choices,  and  in  classification  and  selection  of 
personnel.  However,  by  and  large  the  movement  along  the  continuum  from 
statistical  and  psychological  knowledge  to  educational  practice  has  been 
a  slow  one.  Further  studies  are  still  needed  to  identify  effective 
predictor  variables  appropriate  for  particular  situations  such  as  those 
in  vocational  guidance  and  counseling,  student  selection,  and  manpower 


training  and  utilization. 
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CHAPTER  II 


THE  PROBLEM 

Each  year  an  increasingly  large  number  of  students  of  ages  16 
years  or  over,  who  had  completed  the  eleventh  and  twelfth  grades  in 
high  schools  in  the  province  of  Alberta  and  other  Canadian  provinces, 
applied  for  enrollment  at  the  Northern  Alberta  Institute  of  Technology 
to  pursue  various  technical,  business,  and  vocational  courses.  These 
applicants  were  admitted  primarily  on  the  basis  of  grade  level  completed, 
marks  earned,  and  priority  of  application. 

According  to  the  1966-67  school  calendar^  the  institute  offered 
an  articulated  technology  program  in  three  patterns  of  studies.  The 
three-year  pattern  of  studies  consisted  of  Year  A,  B,  and  C.  Entrance 
requirements  for  admission  into  Year  A  of  this  pattern  were:  67 
Alberta  high  school  credits  (more  or  less  a  completion  of  Grade  XI)  with 
at  least  a  "BM  standing  in  Mathematics  20  or  22,  Science  20  or  22, 
and  English  20.  However,  preference  was  given  to  applicants  having 
additional  Grade  XII  credits  particularly  in  Mathematics  30  or  32.  Year 
A  was  a  pre-technology  program  consisting  of  Grade  XII  mathematics, 
physics,  English,  plus  the  applicable  vocational  subject.  This  program 
was,  in  some  measure,  a  making  up  of  the  deficiency  of  the  student  for 
not  completing  Grade  XII  and  earning  an  Alberta  high  school  diploma. 


There  have  been  some  changes  in  the  school  calendar  of  the  institute 
since  then.  However,  these  changes,  up  to  the  date  of  this  writing, 
did  not  significantly  affect  the  discussion  above. 
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One  of  the  two-year  patterns  of  study  consisted  of  Years  B  and 

C.  The  entrance  requirements  for  admission  to  Year  B  of  this  pattern 

were:  an  Alberta  high  school  diploma  (i.e.  a  completion  of  Grade  XII) 

with  at  least  a  "B"  standing  in  Mathematics  30  or  33,  thirty-five  or 

more  credits  in  one  of  the  articulated  vocational  high  school  subjects 

with  at  least  a  "BM  standing  in  the  final  year  and  credit  in  Physics 

30  or  32,  or  a  successful  completion  of  Year  A  at  the  institute  in 

2 

Edmonton  or  Calgary  . 

Another  two-year  pattern  consisted  of  Years  AB  and  C.  This 
pattern  accommodated  students  coming  from  the  high  school  academic 
program.  The  entrance  requirements  for  admission  to  Year  AB  in  this 
pattern  were:  an  Alberta  high  school  diploma  with  at  least  a  "B" 
standing  in  Mathematics  30  or  32,  Physics  30  or  32,  a  credit  in  English 
30  or  33  and  a  minimum  overall  high  school  average  of  55%.  Year  AB 
was  given  on  an  extended  term  of  10  months  with  33  hours  per  week  of 
instruction.  It  was  felt  that  this  extension  of  study  time  was  suf¬ 
ficient  to  cover  Years  A  and  B  considering  that  those  who  were  admitted 
into  this  pattern  had  high  academic  standing. 

The  articulated  technology  courses  offered  under  these  patterns 
were:  air  conditioning  and  refrigeration,  electrical,  architectural, 

drafting,  electronics,  exploration,  instrumentation,  industrial  production, 
and  telecommunication  technologies.  Figure  1  below  showed  the  various 
routes  under  this  program. 


The  province  of  Alberta  operates  two  institutes  of  technology,  the 
Northern  Alberta  Institute  of  Technology  in  Edmonton  (NAIT)  and  the 
Southern  Alberta  Institute  of  Technology  in  Calgary  (SAIT) . 
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High  School 
Vocational  Program 


High  School 
Academic  Program 


Fig.  1 

Diagram  showing  the  three  patterns  of 
studies  of  the  articulated  technology 

program 

The  institute  also  offered  a  non-articulated  technology  program. 
These  were  technologies  with  specific  entrance  requirements  and  length 
of  training.  The  non-articulated  technologies  which  normally  required 
two  years  to  complete  were:  chemical,  civil,  surveying,  gas,  materials, 
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plastics,  dental  laboratory,  forest,  heavy  duty  equipment,  photographic, 
medical  laboratory,  and  medical  X-ray  technologies.  The  entrance  require¬ 
ments  in  these  non- articulated  technologies  were:  a  high  school  diploma 
and  a  "B"  standing  in  a  high  school  subject  or  subjects  considered 
most  requisite  for  a  particular  technology.  For  example,  in  chemical 
technology  the  applicant  must  be  a  high  school  graduate  with  at  least 
a  "B"  in  chemistry  and  a  credit  in  Mathematics  30  or  32.  The  medical 
laboratory  and  medical  X-ray  technologies  had  special  requirements. 
Applicants  to  these  two  technologies  must  have  Alberta  (senior)  matric¬ 
ulation.  Preference  was  given  to  the  student  with  credits  in  English 
30,  Social  Studies  30,  Mathematics  30,  Language  30,  and  two  of  Chemistry 
30,  Physics  30  or  Biology. 

The  non-ar ticulated  technologies  which  normally  required  one  year 
to  complete  were:  radio  and  TV  and  dental  assisting.  Applicants 
to  these  technologies  must  have  obtained  67  high  school  credits 
and  a  "B"  standing  in  Mathematics  20  or  22,  Science  20  or  22,  and 
English  20. 

The  business  and  vocational  courses  which  normally  required  two 
years  to  complete  were  business  administration,  commercial  cooking, 
dietary  service  technician,  distributive  technology,  electronic  data 
processing,  and  secretarial  technology.  Courses  normally  requiring  one 
year  for  completion  were  banking  program  and  office  machine  mechanics. 

The  minimum  requirement  in  the  banking  program,  business  administration, 
distributive  technology,  and  secretarial  technology  was  a  high  school 
diploma.  Preference  was  given  to  applicants  who  had  50%  in  Grade  XII 
English  and  Grade  XI  mathematics.  Commercial  cooking  required  35 
Alberta  high  school  credits.  Applicants  to  the  dietary  service  technician 
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course  must  have  completed  Grade  XI  with  at  least  50%  grade  in  Grade  XI 
mathematics,  science,  and  English.  In  the  electronic  data  processing 
course  the  applicant  must  have  had  a  high  school  diploma  with  a  "B"  in 
mathematics  30  or  32  and  Grade  XII  English.  Applicants  for  the  office 
machine  mechanics  course  were  required  to  complete  Grade  XI  only. 

To  be  considered  for  admission  to  the  institute  the  applicant 
must  file  an  application  form  stating  such  information  as  the  course 
desired,  age,  height,  weight,  disabilities,  church  affiliation,  marital 
status,  courses  previously  taken  at  the  institute  if  any,  high  school 
subjects  completed  and  corresponding  marks  in  Grades  X,  XI,  and  XII,  and 
related  work  experiences.  As  a  check  an  official  transcript  of  all 
high  school  marks  was  attached  to  the  application  as  soon  as  this  became 
available. 

The  selection  of  students  who  were  finally  accepted  for  enrollment 
was  decided  on  the  basis  of  the  prescribed  requirements  in  the  school 
calendar.  If  there  were  any  doubts  as  to  the  capabilities  of  the  student 
or  his  success  in  a  certain  course  the  student  was  referred  for  inter¬ 
view  with  the  head  of  the  department  concerned  and/or  a  guidance  counselor 
and  their  recommendations  were  combined  with  the  other  requirements  in 
considering  the  final  admission  of  the  student. 

As  may  be  gleaned  from  this  procedure  of  student  selection  reviewed 
above,  the  course  choice  of  the  applicant  could  be  attributed  to  op¬ 
portuneness,  that  is,  both  his  personal  wish  and  the  institution's  will 
were  met.  It  could  be  attributed  in  some  degree  to  the  results  of  the 
interview  with  the  department  head  and/or  a  guidance  counselor  and 
perhaps  it  could  be  attributed  in  some  degree  to  the  vocational  and 
academic  experiences  of  the  student  while  in  high  school,  to  his  work 
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experiences  outside  the  school,  and  to  his  maturing  awareness  of  his 
abilities  and  aptitudes. 

Considering  the  host  of  technologies  and  courses  which  a  student 
could  choose  from,  one  of  the  most  important  responsibilities  of  the 
department  heads  and  guidance  counselors  who  assisted  the  applicant  in 
making  an  appropriate  choice  was  to  judge  the  courses  for  which  the 
student  had  an  aptitude.  To  discharge  this  responsibility  required  some 
basis  of  comparing  the  characteristics  of  the  student  with  groups  to 
which  he  might  probably  best  belong.  The  entrance  requirements  pre¬ 
scribed  by  the  institute  indicated  that  some  assumptions  were  made 
concerning  a  number  of  abilities  of  the  individual  and  the  different  de¬ 
grees  and  combination  which  were  required  for  success  and  satisfaction 
in  the  various  technologies  and  courses.  If  a  number  of  characteristics 
of  the  student  were  observed  at  the  early  part  of  admission  or  before 
admission  to  his  course  and  then  set  aside  until  after  his  graduation, 
one  might  be  able  to  study  how  this  information  could  be  combined  in 
making  a  recommendation  of  enrollment  to  a  certain  course  for  incoming 
students  to  the  institute.  Such  study  would  give  substantial  assist¬ 
ance  to  the  applicant  who  was  making  the  choice  as  well  as  minimize 
misclassif ication  of  placement  on  the  part  of  the  institute. 

During  the  admission  period  the  basic  problem  in  helping  the 
student  applicant  make  an  appropriate  choice  was  to  provide  a  probable 
answer  to  the  question,  "Which  group  did  he  resemble  most?",  before  one 
answered  the  question  "How  well  will  the  individual  performed  if  as¬ 
signed  to  the  group  he  most  resembled?".  In  other  words  it  was  knowing 
whether  the  applicant  most  resembled  the  graduates  of  either  electronics 
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or  drafting  or  business  administration  or  some  other  course.  To 
provide  better  assistance  to  students  studying  at  the  Institute,  with 
regard  to  choice  of  courses,  it  was  felt  that  there  was  a  need  to  develop 
some  means  of  answering  the  first  question  above. 

No  doubt  the  many  problems  like  the  one  cited  above  had  been  in 
the  minds  of  the  guidance  and  counseling  personnel  of  the  institute  for 
some  time.  During  the  early  months  of  the  1966-67  school  term  they 
administered  the  Differential  Aptitude  Tests  and  the  Lorge-Thorndike 
Verbal  and  Non-verbal  Intelligence  Tests  to  all  students  enrolled  for 
the  first  time  in  the  various  technologies,  business,  and  vocational 
courses  for  purposes  of  securing  information  about  students  that  might 
be  helpful  in  vocational  counseling.  Raw  scores  of  these  tests  were 
converted  into  percentile  scores  and  intelligence  quotients  respectively 
using  the  tables  of  norms  provided  in  the  test  manuals.  To  a  certain 
extent  these  results  gave  some  information  useful  in  vocational  counsel¬ 
ing  at  the  institute.  However,  as  stated  earlier,  norms  do  not  go  very 
far  for  purposes  of  prediction  especially  if  the  sample  was  not  com¬ 
parable  with  the  norming  groups. 

Studies  conducted  by  Bennett,  Seashore,  and  Wesman  (1959)  on  the 
predictive  qualities  of  the  Differential  Aptitude  Test  revealed  important 
profile  differences  among  high  school  students  who  entered  diverse 
occupational  and  educational  careers.  A  longitudinal  study  by  Seashore 
(1959)  showed  that  the  level  of  the  career  endeavor  of  students  was  re¬ 
lated  to  their  level  of  aptitudes  and  that  the  job  level  within  an  oc¬ 
cupational  area  was  related  to  the  scores  on  the  most  relevant  sub-tests. 
According  to  the  authors  the  Differential  Aptitude  Tests  were  not  developed 
to  isolate  simple,  pure  abilities  but  rather  to  measure  complex  abilities 
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in  relation  to  job,  families  and  curricula.  However,  a  study  by 
Guilford,  Hoepner,  &  Petersen  (1965)  indicated  that  the  combination  of 
four  DAT  subtests  did  about  as  well  as  factored  battery  tests  among 
several  standard  tests  compared  in  their  study.  The  Lorge-Thorndike 

Intelligence  Tests  were  measures  of  abstract  intelligence  expressed  in 

/ 

verbal  symbols  on  one  part  and  in  pictorial,  diagramatic,  and  numerical 
symbols  on  the  other.  Studies  by  Lorge  &  Thorndike  (1954)  had  shown 
high  correlations  of  these  tests  with  educational  achievement  tests  and 
school  marks.  Not  many  studies  had  been  done  with  these  tests  on  pre¬ 
diction  over  a  period  of  time.  Group  mental  tests,  according  to  Cronbach 
(1960),  predicted  success  on  a  great  many  jobs  although  in  some  jobs, 
special  abilities  were  much  more  important  than  general  ability.  A  re¬ 
view  of  literature  on  prediction  of  success  of  workers  made  by  Ghiselli 
in  1955  substantiated  this  observation  and  indicated  the  importance  of 
group  mental  tests  in  vocational  guidance.  With  certain  types  of  students 
and  a  school  setting  such  as  the  one  at  Northern  Alberta  Institute  of 
Technology  these  tests  mentioned  above  had  not  been  studied  to  determine 
the  prediction  of  membership  of  individuals  among  different  groups. 

By  the  end  of  the  1967-1968  school  term,  two  years  later,  the 
students  who  took  those  tests  had  graduated  from  the  various  technologies, 
business,  and  vocational  courses.  Those  who  started  in  Year  A  had  either 
successfully  completed  Year  B  of  their  respective  courses  or  had  failed 
some  of  their  subjects.  Others  had  either  withdrawn  from  the  institute 
before  they  were  able  to  complete  their  courses,  transferred  to  another 
course  and  weife  continuing  their  studies  in  the  institute,  or  did  not 
graduate  due  to  failure  in  some  subjects. 
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Each  year  there  were  many  applicants  coming  from  the  various 
high  school  programs  who  elected  to  pursue  certain  courses,  who  were 
admitted  to  the  institute  on  the  basis  of  grade  level  and  marks  earned 
on  certain  subjects,  and  who  after  the  end  of  one  or  two  years  appeared 
in  various  training  categories.  However,  very  little  information  was 
provided  to  the  entering  student  about  how  his  aptitudes  and  high  school 
achievements  compared  with  those  students  who  graduated  or  made  satis¬ 
factory  progress  in  the  various  courses. 

The  problem  of  this  study  was,  therefore,  to  develop  a  procedure 
for  interpreting  the  scores  on  the  Lorge-Thorndike  Intelligence  Test, 
the  Differential  Aptitude  Tests,  the  Grade  Nine  Alberta  Departmental 
Examinations,  School  and  College  Ability  Tests  Level  3,  and  the  high 
school  averages  for  predicting  the  training  outcomes  of  graduates  of 
the  Northern  Alberta  Institute  of  Technology. 

So  far,  a  review  of  the  literature  in  this  area  showed  that  no 
study  had  been  made  on  the  variables  stated  above  in  predicting  training 
outcomes  of  students  in  vocational  and  technical  institutes.  Terminal 
schools  like  the  Northern  Alberta  Institute  of  Technology  were  a  special 
type  of  institution  which  provided  technical,  business,  and  vocational 
training  to  a  certain  type  of  student  who  later  make  up  the  considerable 
bulk  of  the  main  work  force  of  the  nation  and  who  no  longer  enter  the 
higher  institutions  of  learning.  Very  little  research  information  was 
known  about  this  type  of  student. 

Theoretical  framework 

Tyler  (1959)  in  her  theoretical  analysis  toward  a  workable 
psychology  of  individuality  pointed  out  that  the  emphasis  on  the 


15 


traditional  measurement  approach  based  on  dimensions  or  trait  continua 
might  not  likely  resolve  the  prediction  of  human  behavior.  Some  evidence 
had  accumulated  which  indicated  the  need  for  the  use  of  a  kind  of  measure¬ 
ment  which  would  characterize  the  person’s  customary  pattern  of  choice 
and  organization.  One  impressive  piece  of  evidence  which  supported  this 
contention  was  the  20-year  follow-up  study  of  the  Strong  Vocational 
Interest  Blank  (1955)  based  on  letter  grades  as  predictors  of  a  special 
kind  of  criterion  -  that  of  remaining  in  the  original  occupation  versus 
shifting  to  another.  These  letter  grades  carried  different  meaning  from 
customary  trait  measurement.  An  "A"  would  signify  that  the  person’s 
customary  pattern  of  choice  and  organization  was  like  the  choice  pat¬ 
tern  characteristics  of  persons  in  a  certain  occupation. 

Cooley  (1964)  in  a  similar  vein  explained  that  the  basic  proposi¬ 
tion  was  that  different  plans  were  appropriate  for  different  people.  The 
problem,  was  to  develop  a  method  for  dealing  with  the  relationships  be¬ 
tween  types  of  plans  and  kinds  of  people. 

From  factor  theory  it  was  possible  to  deal  with  the  taxonomic 
problem  of  plans  and  people.  In  conceptualizing  human  behavior  in 
factor  theory,  personality  had  its  locus  in  an  m-dimesnional  space. 

The  location  of  the  individual's  personality  in  this  space  was  determined 
by  the  total  pattern  of  the  m-behavioral  measures  which  were  available 
for  that  individual.  Personality,  as  used  here,  referred  to  all  be¬ 
havior  and  included  intellectual  functioning  as  conceptualized  by 
Guilford  (1959).  Individuals  with  similar  characteristics  tended  to 
occupy  similar  regions  in  this  m-dimensional  space.  This  meant  that 
individuals  with  similar  personalities  would  have  a  tendency  to  make  a 
similar  occupational  choice.  In  this  study  occupational  choice  was 
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based  on  personality  and  not  on  things  like  occupation  of  the  father, 
economic  status,  religion,  etc.  If  we  were  able  to  determine  the  re¬ 
gions  of  the  groups  of  individuals  who  had  made  particular  choices, 
providing  that  these  regions  were  distinct  with  respect  to  these  personal 
characteristics  and  that  the  individuals  were  drawn  from  the  population 
for  wtom  each  region  was  constructed,  it  would  be  possible  to  estimate 
the  probability  of  the  choice  of  another  person  with  a  similar  pattern 
of  personality. 

Some  studies  suggested  that  different  levels  of  aptitude  were 
required  of  different  educational  and  occupational  levels.  Some  studies 
suggested  that  individuals  developed  a  certain  awareness  of  these  dif¬ 
fering  degrees  of  abilities  and  aptitudes  at  some  age  level  approximately 
during  their  senior  or  high  school  graduation.  Also,  one's  choice  of 
education  and  occupation  was  influenced  to  a  certain  extent  by  this 
maturing  awareness  of  abilities  and  aptitudes.  Taking  cognizance  of 
these  findings,  this  study  postulated  that  the  scores  made  by  individuals 
on  the  Lorge-Thorndike  Intelligence  Tests,  the  Differential  Aptitude 
Tests,  the  Alberta  Grade  Nine  Departmental  Examinations,  the  School 
and  College  Ability  Tests  Level  3,  and  the  high  school  averages 
could  be  used  in  determining  different  regions  in  an  m-dimensional  test 
space  which  certain  groups  tended  to  occupy.  It  further  postulated  that 
certain  regions  defined  in  the  m-dimensional  space  were  distinct  with 
respect  to  these  characteristics  and  that  a  person's  likelihood  of 
membership  in  a  certain  occupational  group  could  be  predicted  from  his 
pattern  of  scores  on  these  tests  and  high  school  averages. 
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Null  hypotheses 

This  study  tested  or  investigated  the  following  null  hypotheses: 

1.  There  are  no  significant  differences  among  group  means  in  the 
sub-scores  of  the  Lorge-Thorndike  Intelligence  Tests,  the  Differential 
Aptitude  Tests,  the  Alberta  Grade  Nine  Departmental  Examinations,  SCAT 
Level  3,  and  the  high  school  averages  of  the  groups  of  students  class¬ 
ified  as  graduates  and  non-graduates  on  an  ji  priori  basis  according  to 
training  outcomes. 

2.  The  characteristics  of  the  various  training  groups  as  measured 
by  the  nineteen  predictor  variables  consisting  of  the  sub-scores  of  the 
Lorge-Thorndike  Intelligence  Tests,  the  Differential  Aptitude  Tests,  the 
Alberta  Grade  Nine  Departmental  Examinations,  SCAT  Level  3,  and  the 

high  school  averages  are  not  different. 

3.  The  dispersions  of  groups  along  the  discriminant  vectors  as 
defined  by  the  sub-scores  of  the  Lorge-Thorndike  Intelligence  Tests, 
the  Differential  Aptitude  Tests,  the  Alberta  Grade  Nine  Departmental 
Examinations,  SCAT  Level  3,  and  the  high  school  averages  are  not 
significant. 

4.  The  sub-scores  of  these  tests  stated  above  and  the  high  school 
averages  do  not  predict  probable  membership  of  students  to  certain  train¬ 
ing  groups  as  classified  on  an  a.  priori  basis  according  to  training 
outcomes . 

5.  There  are  no  psychological  meanings  or  reasons  that  might  be 
derived  for  group  positions  in  the  discriminant  space  based  on  the  sub¬ 
scores  of  these  tests  stated  above  and  the  high  school  averages. 


CHAPTER  III 


REVIEW  OF  LITERATURE 

The  main  problem  of  this  study  was  to  determine  the  membership 
of  individuals  in  training  outcome  groups  on  the  basis  of  a  combination 
of  predictor  variables.  Discriminant  analysis  had  been  found  most  ap¬ 
propriate  for  such  a  problem.  Travers  (1939)  demonstrated  its  use  in  dis¬ 
tinguishing  successful  engineers  from  successful  air  pilots.  He  secured 
twenty  samples  from  each  of  the  two  groups  and  used  six  test  scores  of 
intelligence,  form  relations,  dynamometer,  dotting,  sensory-motor  coordin¬ 
ation,  and  perserveration.  However,  the  purpose  of  his  study  was  merely 
to  show  the  possibilities  of  the  discriminant  analysis  in  psychological 
investigations.  Garret  (1943)  used  the  same  data  as  Traverd  but  utilized 
the  scores  of  three  tests  only.  His  purpose  was  also  similar  to  Travers’. 
His  interest  was  in  explaining  and  showing  the  calculation  of  the  dis¬ 
criminant  function  in  such  a  manner  that  psychologists  who  were  not  so 
mathematically  oriented  may  be  able  to  make  use  of  it. 

One  of  the  early  psychological  investigations  on  determining 
group  membership  using  the  discriminant  function  was  done  by  Baggaley 
(1947)  of  Harvard  University.  He  wanted  to  study  the  relation  be¬ 
tween  the  scores  obtained  by  180  Harvard  freshmen  on  the  Kuder  Prefer¬ 
ence  Record  and  their  fields  of  concentration.  He  classified  the  22 
fields  of  concentration  into  two  groups:  Group  A  consisted  of  those  who 
chose  the  natural  science  area  and  Group  B  consisted  of  those  who  chose 
the  social  sciences  and  humanities.  His  findings  showed  that  the  Kuder 
scores  provided  a  basis  for  differentiating  among  Harvard  students  who 
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proposed  to  concentrate  in  different  academic  fields.  However,  the  uses 
of  the  findings  for  guidance  was  rather  limited  to  the  classification 
of  two  broad  categories  of  academic  training  outcomes  since  Groups  A 
and  B  lumped  together  several  fields  of  concentration. 

It  was  only  in  1950  that  a  method  was  developed  to  use  discrimi¬ 
nant  analysis  with  more  than  two  groups.  Working  independently,  Bryan 
(1950)  at  Harvard  developed  a  method  for  the  exact  determination  of  the 
characteristic  equation  and  latent  vectors  of  a  matrix  with  application 
to  the  discriminant  function  for  more  than  two  groups.  At  about  the 
same  time  Rao  (1952)  in  India  also  independently  generalized  the  method 
to  investigate  differences  among  the  multivariate  distribution  on  any 
number  of  groups  in  problems  of  biological  classifications.  Using  the 
scores  of  the  Kuder  Preference  Record  of  freshmen  at  Harvard,  Bryan  demon¬ 
strated  that  his  method  could  be  used  on  a  large  scale  of  predictor  var¬ 
iables  and  reduced  the  numerical  process  of  calculating  the  discriminant 
roots  and  latent  vectors. 

Tiedeman  and  Steinberg  (1951)  investigated  the  difference  between 
the  regression  analysis  and  discriminant  analysis  in  a  classification 
problem  and  showed  the  possibility  of  misclassif ication  in  the  former. 

The  data  used  were  the  sub-scores  of  the  Differential  Aptitude  Tests. 

The  tests  were  administered  to  207  beginning  ninth  grade  students  in 
Waltham,  Massachusetts  who  were  allowed  to  choose  from  the  various 
secondary  curricula.  At  the  end  of  their  tenth  grade  most  of  the  students 
had  chosen  either  the  college  preparatory  or  the  business  curricula.  By 
regression  analysis  using  the  tenth  grade  average  as  criterion,  the  re¬ 
sults  showed  that  if  Waltham  students  had  chosen  the  curriculum  in  which 
they  were  more  likely  to  receive  their  highest  mark,  the  college 
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preparatory  curriculum  would  be  the  choice  of  all  the  students.  However, 
when  the  grade  average  criterion  scores  were  transformed  to  standard 
scores  the  two  regression  equations  when  recomputed  showed  that  the  bus¬ 
iness  curriculum  regression  was  higher  than  the  college  preparatory 
curriculum  regression.  This  result  showed  that  choosing  the  curriculum 
in  which  Waltham  students  were  more  likely  to  receive  the  highest  mark 
did  not  imply  appropriate  choice  of  curriculum  for  each  student.  How¬ 
ever,  in  a  discriminant  analysis,  the  Differential  Aptitude  Tests  clearly 
distinguished  between  students  who  chose  either  the  college  preparatory 
curriculum  or  the  business  curriculum.  Although  the  study  was  limited 
to  only  two  groups  it  showed  the  effectiveness  of  the  discriminant 
analysis  for  classifying  groups  as  compared  with  regression  analysis. 

Tiedeman,  Bryan,  and  Rulon  (1953)  investigated  the  utility  of  the 
Airman  Classification  Battery  for  assignment  of  airmen  to  eight  air 
force  specialities  using  the  method  of  calculating  the  multiple  dis¬ 
criminant  function  developed  by  Bryan.  Since  airmen  were  already  members 
of  the  Air  Force  the  problem  was  to  assign  each  airman  to  a  job  which 
maximized  satisfaction  with  the  total  assignment.  The  seventeen  stanine 
sub-scores  of  the  Airman  Classification  Battery  AC-1  used  in  the  study 
were:  clerical  key,  mechanical  key,  crafts  key,  equipment  operator  key, 

radio  operator  key,  word  knowledge,  arithmetic  reasoning,  dial  and  table 
reading,  numerical  operations  II,  aviation  information,  background  for 
current  affairs,  electrical  information,  mechanical  principles,  general 
mechanics,  tool  functions,  speed  of  identification,  memory  of  landmarks. 
Assignment  to  a  technical  training  school  was  based  on  the  choices  of  the 
airmen  as  expressed  during  the  career  counseling  interview  within  re¬ 
strictions  of  aptitude  indices,  quota,  and  personnel  supply.  The  eight 
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technical  specialities  selected  for  the  study  were:  radio  operator, 
clerk- typis t ,  control  tower  operator,  aircraft  and  engine  mechanic, 
radio  mechanic,  weather  observer,  airplane  sheetmetal  worker,  and  radar 
mechanic.  After  two  years  of  technical  training,  6105  airmen  who  were 
considered  successful  for  having  received  an  average  grade  of  2.5  or 
better  were  selected  as  subjects  in  this  study.  Their  scores  in  the 
Classification  Battery  were  used  in  the  discriminant  analysis  and  com¬ 
putation  of  centour  scores. 

The  results  showed  that  all  information  concerning  the  separation 
of  the  eight  17-variable  centroids  could  be  described  by  a  two-dimen¬ 
sional  discriminant  space.  Centour  scores  for  all  groups  were  computed 
and  used  as  a  basis  for  the  construction  of  a  reference  centour  table. 
The  size  of  the  latent  roots  and  the  overlap  of  group  distributions 
showed  that  the  Airman  Classification  Battery  achieved  a  rather  small 
amount  of  group  separation.  This  indicated  that  misclassif ication  would 
have  been  large  for  a  considerable  number  of  airmen  if  this  battery  was 
used  for  airmen  assignment.  The  results  of  the  analysis  were  not 
validated  on  check  or  new  samples.  The  emphasis  of  this  study  was  to 
show  the  use  of  discriminant  analysis  and  centours  for  personnel  class¬ 
ification  and  placement  problems.  Further  investigations  were  still 
needed  in  identif iying  other  variables  and  other  a.  priori  classification 
of  groups  which  may  show  a  better  separation  of  groups. 

Stinson  (1958)  conducted  a  discriminant  analysis  study  of  en¬ 
gineering  students  at  Oklahoma  State  University  involving  four  variables 
and  three  groups.  The  four  variables  used  were  the  total  scores  on  the 
ACE,  the  Verbal  Comprehension  and  General  Reasoning  sub-scores  of  the 
Guilf ord-Zimmerman  Aptitude  Survey  and  the  Scientific  Scale  of  the  Kuder 
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Preference  Record.  The  three  groups  differentiated  were  engineering 
graduates,  non-engineering  graduates,  and  dropouts.  Each  group  consisted 
of  thirty  randomly  selected  subjects  who  took  the  test  five  years  before. 
The  purpose  of  the  study  was  to  determine  the  critical  discriminant 
function  scores  (v-scores)  of  the  three  training  categories  of  students. 
Using  the  mean  values  of  the  tests  of  each  category,  the  discriminant 
function  scores  were  computed  three  times.  The  predicted  v-score  of 
engineering  graduates  was  2.048146,  the  non-engineering  graduates  was 
1.673095,  and  the  dropout  was  1.509139.  Midway  between  the  predicted 
v-scores  were  the  values  which  were  considered  critical  scores.  These 
critical  scores  were  then  considered  as  the  basis  for  predicting  the 
individual’s  pattern  of  measured  traits.  If  the  predicted  v-score  of 
a  student  was  greater  than  1.860621,  his  pattern  of  measured  traits 
resembled  that  of  the  engineering  graduate.  A  v-score  between  1.591117 
and  1.860621  was  taken  to  indicate  that  the  student  was  more  like  those 
who  transferred  to  another  college.  A  v-score  of  below  1.591117  prob- 
bly  indicated  the  pattern  of  measured  traits  of  a  dropout.  This  study 
was  intended  to  show  a  method  for  counseling  engineering  students  using 
the  discriminant  coefficients  as  weights  in  calculating  the  predicted 
discriminant  function  score  of  an  individual.  The  number  of  test  var¬ 
iables  and  number  of  cases  studied  were  quite  limited  and  further 
validation  was  needed  from  a  check  or  fresh  sample.  However,  the  study 
demonstrated  the  potential  uses  of  the  discriminant  analysis  in  answer¬ 
ing  the  question  of  the  student’s  membership  with  a  training  outcome 
group. 

Dunn  (1959)  investigated  the  validity  between  the  discriminant 
and  regression  methods  in  a  study  using  1380  students  at  Brown  University. 
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Her  purpose  was  to  find  out  whether  a  student’s  major  field  in  college 
was  better  predicted  by  discriminant  or  regression  analysis.  Fourteen 
fields  of  concentration  groups  and  13  variates  consisting  mostly  of 
achievement  tests  were  used  in  this  study.  An  experimental  sample  of  925 
cases  who  successfully  completed  the  bachelor  of  arts  degree  in  the  14 
fields  of  concentration  were  chosen  by  stratified  random  selection. 
Similarly  a  check  sample  of  455  cases  were  selected  in  order  to  test 
the  validity  of  the  two  techniques. 

The  discriminant  analysis  made  on  the  experimental  sample  showed 
that  the  first  two  discriminant  functions  accounted  for  86  percent  of 
the  possible  discrimination.  These  first  two  discriminants  were  used 
in  calculating  the  discriminant  weights.  Discriminant  scores  were  then 
computed  on  the  455  cases  in  the  check  sample  using  the  equations  es¬ 
tablished  in  the  analysis  of  the  experimental  samples.  Chi  squares 
were  calculated  to  obtain  the  centour  scores.  Using  the  same  13  variates 
and  14  fields  of  concentration,  multiple  regression  analysis  was  computed 
on  the  experimental  samples  using  the  grade  point  averages  in  the  fields 
of  concentration  as  the  criterion  of  success. 

The  results  of  the  calculation  of  discriminant  weights  and  re¬ 
gression  weights  showed  that  variables  found  as  useful  predictors  in 
identifying  distinct  groups  in  the  discriminant  analysis  were  quite 
different  from  variates  found  as  useful  predictors  of  success  in  the 
regression  analysis.  The  variables  which  best  separated  the  chemistry 
field  from  other  fields  in  discriminant  analysis  were  secondary  school 
rank,  mathematics,  and  science  achievement  tests.  While  these  three 
measures  were  important  for  success  in  chemistry,  the  regression  analy¬ 
sis  showed  that  verbal  aptitude  and  reading  ability  had  high  weights 
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instead.  These  contrasting  differences  in  usefulness  of  variables  as 
predictors  showed  in  the  various  fields  of  concentration  in  the  two 
analyses . 

In  order  to  verify  the  effectiveness  of  prediction  of  the  two 
approaches,  two  discriminant  scores  and  14  grade  predictions  were  com¬ 
puted  and  applied  to  455  students  in  the  check  sample.  For  each  subject 
chi  squares  were  computed  from  his  discriminant  scores  and  the  three 
groups  which  he  was  most  likely  a  member  was  determined.  Also,  the  three 
highest  predicted  averages  computed  from  the  regression  weights  were 
selected  for  each  student  in  the  check  sample.  The  predictions  by  the 
two  methods  were  tabulated  as  hits,  near  hits,  and  misses.  From  these 
results  Dunn  concluded  that  the  prediction  from  discriminant  scores  was 
far  superior  than  regression  scores.  Dunn  pointed  out  that  using  re¬ 
gression  scores  might  lead  to  a  choice  of  a  major  field  in  which  the  student 
lacked  certain  abilities  to  be  successful.  Also,  it  might  lead  to  a 
choice  in  which  his  strongest  abilities  would  not  be  fully  utilized. 

Dressel  in  an  appendage  to  Dunn's  article,  commented  that  dis¬ 
criminant  and  regression  procedures  offered  two  different  purposes  of 
choices.  Regression  analysis  in  this  study  was  made  to  do  something 
which  it  was  not  intended  to  do.  Dressel  did  not  quite  agree  with  Dunn's 
suggestion  to  abandon  the  multiple  regression  procedure.  Dressel  ex¬ 
plained  that  it  was  possible  that  grade  expectation  was  a  normal  sit¬ 
uation  which  students  encounter  in  their  school  life.  There  were 
students  who  entered  a  field  because  they  were  sure  of  getting  high 
marks  and  this  was  usually  reasonable  with  certain  conditions.  Dressel 
also  raised  the  social  implications  involved  on  this  matter  and  asked 
whether  we  were  after  status  quo  or  upgrading  of  the  person  in  the  field. 


, 


25 


He  maintained  that  the  multiple  regression  analysis  "holds  the  possibility 
of  uprading  quality  of  persons  if  this  appears  to  be  desirable."  A 
comparison  between  these  two  procedures  was  explained  by  Rulon  (1951)  in 
a  meeting  to  the  American  Psychological  Association  at  Pennsylvania  State 
College  pertinent  portions  of  which  were  quoted  below: 

The  discriminant  analysis  does  what  the  multiple  cor¬ 
relation  approach  does  not.  It  uses  group  membership  as  the 
criterion  and  makes  all  comparisons  between  groups  and  none 

within  the  groups . the  multiple  correlation  technique 

applied  separately  to  several  groups  and  the  multiple  discrim¬ 
inant  technique  applied  to  the  several  groups  simultaneously. 


Clearly  the  multiple  correlation  technique  applied  to  one 
group  ignores  all  the  data  from  other  groups.  The  multiple  dis¬ 
criminant  technique  employs  all  the  data  from  all  the  groups. 

. . .  The  multiple  discriminant  technique  requires  that  the  same 
kinds  of  measurements  be  made  on  all  the  members  of  all  the 
groups.  ... 

The  multiple  correlation  technique  does  not  require  that 
the  same  tests  be  used  in  all  groups  and  this  is  an  advantage 
of  a  sort,  but  it  means  that  there  is  no  basis  for  between- 
groups  comparisons  [p4  88]. 

Tiedeman  (1951)  pointed  out  that  discriminant  analysis  is  not  a 
replacement  of  regression  analysis.  These  two  types  of  analysis  were 
useful  for  two  different  types  of  problems.  Discriminant  analysis  was 
useful  in  answering  the  question  of  a  person's  resemblance  to  certain 
groups  while  regression  analysis  helped  answer  the  question  of  "What 
am  I  best  at?"  Either  one  of  them  could  be  a  powerful  tool  to  test  a 
theory . 


Calia  (1960)  investigated  the  use  of  discriminant  analysis  in 
the  prediction  of  scholastic  performance.  He  was  interested  in  the  gen¬ 
eral  problem  of  academic  survival  versus  attrition.  He  used  the  pre¬ 
admission  and  first  semester  data  of  the  entering  class  of  1957  at  Boston 
University  Junior  College  freshmen  as  the  independent  variables  for  the 
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prediction  of  membership  in  one  of  three  academic  groups  namely: 
scholastic  failures,  terminal  prospects,  and  transfer  candidates.  While 
ideally  the  freshman  data  of  the  following  year  should  have  been  used 
as  cross-validation  sample,  the  discriminant  analysis  data  obtained  in 
this  1957  sample  were  used  to  predict  the  group  membership  of  students 
from  the  preceding  year’s  class  to  obviate  waiting  for  a  year  before  the 
number  of  hits  and  misses  could  be  tallied.  Four  analyses  were  carried 
out  in  variables  assembled  in  four  batteries  instead  of  all  33  variables 
analyzed  simultaneously.  Eight  variables  of  the  Differential  Aptitude 
Tests  constituted  the  first  battery;  ten  variables  from  the  Kuder  Pre¬ 
ference  Record  made  up  the  second  battery;  six  nonintellectual  variables 
from  the  Survey  of  Study  Habits  and  Attitude  Scores  and  of  two  Jervis 
Self-Description  Inventory  Scores  made  up  the  third  battery;  and  nine 
variables  from  high  school  achievement  and  activity  participation  in- 
dicies,  grade  point  index,  Cooperative  English  Tests  in  Vocabulary,  Speed, 
and  Level,  Otis  Gamma  Form  AM,  Scholastic  Aptitude  Test  in  Verbal  and 
Mathematics  made  up  the  fourth  battery.  The  first  three  analyses  in¬ 
volved  only  males  while  the  fourth  analysis  involved  both  sexes. 

The  results  showed  that  only  one  discriminant  function  was  needed 
to  account  for  membership  among  groups  in  each  analysis.  This  indicated 
the  possibility  of  collinearity  with  the  use  of  discriminant  analysis. 

On  the  basis  of  the  Differential  Aptitude  Tests  battery  40  percent  were 
hits  on  the  failure  group,  43  percent  on  the  terminal  group,  and  42  per¬ 
cent  on  the  transfer  group.  The  Kuder  Preference  Record  more  or  less 
predicted  hits  similar  to  the  DAT  battery.  Among  the  batteries  analyzed 
the  fourth  battery,  consisting  of  high  school  achievement,  grade  point 
index,  IQ  and  aptitudes,  predicted  the  best  in  the  study  and  was  followed 
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by  the  DAT  and  Kuder.  The  non- intellectual  battery  was  not  an  effective 
predictor  although  it  was  a  promising , predictor  of  membership  with  the 
DAT  and  Kuder.  Battery  3  and  4  showed  as  better  predictors  for  the  high 
achievers  than  for  the  low  achievers.  The  high  relative  weight  of  grade 
point  index  as  a  variable  indicated  that  the  nature  of  one’s  scholastic 
achievement  was  what  matters  in  discriminating  among  failure,  terminal, 
and  transfer  groups.  For  the  Boston  University  Junior  College  freshmen, 
academic  performance  was  highly  related  to  scholastic  aptitude  and  for 
science  and  humanities,  interest  represented  a  factor  essential  to  success. 

Tiedeman  commented  in  an  appendage  to  Calia’s  article  that  the 
predominant  discrimination  of  grade  point  index  in  this  study  was  not 
surprising  when  one  analyzed  groups  in  relation  to  the  variable  primarily 
defining  them.  The  result  of  collinearity  of  centroids  would  seem  in¬ 
evitable. 

Cass  and  Tiedeman  (1960)  conducted  a  study  on  vocational  develop¬ 
ment  and  election  of  a  high  school  curriculum.  Th^ wanted  to  investigate 
the  reflections  of  self  in  vocational  development  as  evidenced  through 
school  curriculum  choices  using  variables  such  as  scores  of  the  Otis 
Quick-Scoring  Mental  Ability  Tests  Beta  Form  CM,  the  ten  sub-scores  of 
the  Kuder  Preference  Record,  the  Number  and  Names  Tests  of  the  Minnesota 
Clerical  Tests,  the  Minnesota  Paper  Form  Board,  age,  sex,  family  income 
as  estimated  from  median  annual  pay  in  the  state  of  Maine  based  on  the 
1950  census.  Data  were  taken  from  884  grade  nine  pupils  in  nine  second¬ 
ary  schools  in  the  state  of  Maine  in  the  fall  of  1951.  In  the  state  of 
Maine,  pupils  after  completing  grade  9  may  elect  any  one  of  the  follow¬ 
ing  high  school  curricula:  college,  general,  commercial,  home  economics, 
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industrial  arts,  and  agriculture.  After  the  completion  of  grade  10  those 
pupils  who  took  the  tests  and  earned  a  grade  average  of  75  or  above  in 
the  final  grades  in  all  subjects  were  included  in  the  analysis.  The 
number  of  cases  analyzed  finally  reduced  to  466  due  to  attrition  and  lack 
of  success.  For  cross-validation  152  of  the  cases  were  selected  by 
stratified  random  sampling. 

The  results  of  the  discriminant  analysis  and  Rao’s  significance 
test  showed  that  3  significant  roots  accounted  for  95%  of  the  total  dis¬ 
criminating  variations  of  the  six  groups  as  measured  by  18  variables. 

The  first  discriminant  function  indicated  that  sex  was  important  in  cur¬ 
riculum  choice  and  clearly  differentiated  boys  and  girls  in  the  two 
vocational  education  areas  as  interpreted  from  the  weight  of  this 
variable  and  from  the  relative  projections  of  the  group  centroids  in 
function  1.  Similarly,  the  second  function  indicated  that  age  for 
grade,  family  income,  and  academic  orientations  differentiated  the 
college  group  from  the  rest.  The  third  function  differentiated  inter¬ 
ests  in  outdoor  and  computational  activities  among  the  boys  in  the  two 
vocational  education  areas.  Curriculum  choices  were  better  differentiated 
by  7  of  the  Kuder  interest  scales  as  shown  by  the  high  weights  in  these 
variables.  The  choices  were  also  differentiated  by  the  Minnesota  clerical, 
Bennett  mechanical,  scholastic  aptitude,  age,  family  income,  and  sex. 

In  this  study  aptitude  showed  a  minor  role  in  determining  curriculum 
choice. 

Generally,  the  hit  rate  in  the  check  samples  was  about  50%.  To 
predict  exact  curriculum  choices  only  about  one-sixth  of  the  cases  could 
be  considered  hits.  To  predict  college,  general,  or  male  or  female  pro¬ 
gram  of  vocational  education  about  one-third  of  the  cases  could  be 
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considered  hits.  The  study  indicated  that  in  the  early  high  school 
life  of  the  pupil  choices  of  a  curriculum  was  influenced  by  sex  role, 
family  income,  age  at  that  grade,  and  interests,  and  less  influenced  by 
aptitudes.  As  pupils  entered  high  school  they  were  little  aware  of 
aptitudes.  Awareness  of  aptitudes,  as  measured  by  psychological  tests, 
matured  late  (O'Hara,  1958;  Madaus  &  O'Hara,  1967)  and  was  better  com¬ 
prehended  in  the  senior  year  of  high  school.  The  development  of  general 
values  seemed  to  occur  throughout  the  high  school  period. 

One  comprehensive  study  on  the  prediction  of  membership  of  in¬ 
dividuals  in  certain  groups  was  that  conducted  by  King  (1958)  in  predict¬ 
ing  choice  of  undergraduate  fields  of  concentration  in  Harvard  College 
using  multiple  discriminant  analysis.  The  36  predictor  variables  con¬ 
sisted  of  sub-scores  of  the  Kuder  Preference  Record  and  scores  on  the 
academic  aptitude  survey  tests  developed  in  Harvard  which  included  sub¬ 
tests  in  social  science  concepts,  social  science  reasoning,  arithmetic, 
biology,  diagram  interpretation,  spatial  relations,  science,  algebra, 
matrices,  camouflaged  designs,  verbal  production,  etc.  The  36  groups 
of  graduates  investigated  included  such  fields  as  mathematics,  physics, 
engineering,  astronomy,  chemistry,  biochemistry,  biology,  geology, 
physical  science,  psychology,  anthropology,  social  science,  economics, 
government,  history,  literature,  English,  classics,  romance  languages, 
philosophy,  etc.  The  sample  who  took  the  tests  consisted  of  521 
freshmen.  A  check  sample  of  299  cases  was  selected  for  validation. 

The  full  36  x  36  among  groups  and  within  groups  matrices  were 
used  in  the  discriminant  analysis  using  Bryan's  method  of  calculation 
with  some  modification.  Another  discriminant  analysis  was  made  on  the 
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10  largest  components  from  the  results  of  the  first  analysis.  Four 
discriminant  roots  were  found  significant  in  both  analyses  and  were  then 
used  to  determine  the  dimensionality  of  the  reduced  space.  Since  the 
results  of  both  computations  were  very  similar,  the  rest  of  the  statistical 
analysis  was  done  on  the  10  components  instead  of  the  original  thirty-six. 

Validation  with  the  check  samples  was  tabulated  to  show  direct 
hits,  near  misses,  and  misses.  The  predicted  hits  and  near  misses  sig¬ 
nificantly  exceeded  chance  hits  and  near  misses  with  a  probability  well 
below  .001-level.  The  general  conclusion  of  the  study  was  that  inter¬ 
ests  and  aptitudes  were  effective  predictors  of  undergraduate  fields  of 
concentration  at  Harvard  College.  However,  it  would  have  been  interest¬ 
ing  to  show  the  percentage  of  hits  than  just  significance  level.  The 
investigator  suggested  that  a  similar  analysis  of  a  limited  number  of 
College  Board  entrance  tests  and  other  admission  variables  in  conjunction 
with  interest  or  attitude  tests  self  administered  after  admission  but 
prior  to  registration  could  be  useful  as  guidance  information  to  freshmen 
through  their  freshmen  adviser  before  selecting  a  field  of  concentration 
after  completing  the  sophomore  year. 

Stahmann  and  Wallen  (1966)  conducted  a  multiple  discriminant 
prediction  of  the  major  field  of  study  at  graduation  for  a  sample  of 
students  at  the  University  of  Utah.  The  tests  used  were:  the  Cooperative 
English  Test  Lower  Level,  Cooperative  Mathematics  Pre-Test  for  College 
Students,  Cooperative  General  Achievement  Tests  in  Natural  Science,  Occu¬ 
pational  Interest  Inventory  Advanced  Level,  and  the  index  of  urban  or  rural 
high  school  attendance.  Random  male  samples  of  fifty  each  from  the  en¬ 
gineering,  business,  pharmacy,  letters  and  science  fields  were  selected 
to  constitute  the  experimental  groups  upon  which  the  discriminant  analyses 
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were  computed.  Another  fifty  random  samples  from  each  major  field  were 
selected  to  constitute  the  cross  validation  groups  upon  which  the  predic¬ 
tions  were  made.  The  same  procedure  was  followed  in  securing  the  random 
female  samples  for  nursing,  elementary  education,  and  letters  and  science 
fields.  The  discriminant  analyses  for  males  and  for  females  were  made 
separately . 

The  results  showed  that  for  the  male  cross  validation,  prediction 
exceeded  the  number  of  correct  classifications  expected  by  chance  for 
engineering  and  letters  and  science  but  not  for  the  business  field.  The 
results  of  the  female  cross  validation  sample  was  very  significant.  An 
analysis  was  also  made  between  predicted  and  actual  field  of  study  for 
the  male  and  the  female  experimental  samples  using  the  same  discriminant 
functions  of  the  experimental  samples.  The  results  of  these  predictions 
for  each  field  of  study  exceeded  chance  expectation.  However,  the  re¬ 
sults  of  classification  were  not  shown  in  percentage  of  hits  which  would 
have  been  more  interesting  and  meaningful  than  significant  level. 

The  investigators  concluded  that  the  freshmen  entrance  battery 
tests  at  the  University  of  Utah  was  shown  to  be  an  effective  predictor 
of  the  major  field  at  graduation.  They  suggested  that  other  variables 
be  used  other  than  achievement  and  interest  measures  for  the  business 
field  group.  This  study  was  concerned  with  academic  college  fields. 
Investigations  with  regard  to  the  prediction  in  the  technical  and  vo¬ 
cational  fields  above  the  high  school  but  below  the  college  grade  have 
seldom  been  carried  out  although  a  far  larger  number  of  young  adults 
elected  various  occupations  in  this  level. 

Another  similar  study  on  multiple  discriminant  prediction  of 
college  career  choices  was  conducted  by  Vacchiano  and  Adrian  (1966) . 
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It  explored  the  feasibility  of  using  personality  need  constructs  as 
measured  by  the  Stern’s  Activity  Index  (AI) .  This  measure  consisted  of 
300  need  items  describing  common  activities  and  feelings.  The  five 
college  groups  investigated  were  business,  chemistry,  and  mathematics 
male  students  and  education  and  nursing  female  students.  Like  the  study 
of  Stahmann  and  Wallen  about  50  students  were  randomly  selected  to  com¬ 
pose  the  criterion  group  and  about  35  students  to  compose  each  cross 
validation  group. 

The  results  showed  only  one  significant  discriminant  root  which 
accounted  for  72%  of  the  total  discriminating  variations  of  the  5  groups. 
Interpretations  were  made  on  the  need  scales  furnishing  the  greatest 
relative  contribution  in  the  different  fields  for  both  the  male  and  fe¬ 
male  groups  based  on  the  discriminant  weights  in  function  1.  For  example, 
the  business  group  showed  high  scores  for  needs  affiliation,  aggression, 
dominance,  exhibition,  fantasized  achievement,  humanism,  and  play. 

Correct  classifications  of  the  male  and  female  groups  were  above  chance 
expectations.  Sixty-five  percent  of  education,  58%  of  nursing,  54%  of 
business  and  33%  of  mathematics  students  were  correctly  predicted. 

This  study  suggested  that  prediction  of  academic  major  might  be 
feasible  using  personality  variables  as  measured  by  the  Activities 
Index  although  other  cognitive  factors  were  also  important  in  determining 
vocational  goals. 


Summary 

Discriminant  analysis,  which  originated  with  Fisher  (1936), 
was  suggested  by  Travers  (1939)  and  Garret  (1943)  for  application  in 
investigating  psychological  classification  problems.  Early  investiga¬ 
tions  using  the  discriminant  analysis  in  psychology  were  more  concerned 
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with  verifying  its  effectiveness,  in  comparing  this  procedure  with  re¬ 
gression  analysis,  and  in  finding  a  method  of  solution  for  analysis 
involving  more  than  two  groups.  The  later  studies  turned  to  identify¬ 
ing  different  types  of  predictor  variables.  Baggaley  (1947)  used  the 
sub-scores  of  the  Kuder  Preference  Record  as  predictors  for  two  groups 
of  Harvard  students.  Tiedeman  and  Steinberg  (1951)  used  the  sub-scores 
of  the  Differential  Aptitude  Tests  for  grade  nine  pupils.  Tiedeman, 

Bryan,  and  Rulon  (1953)  investigated  the  utility  of  the  Airman  Class¬ 
ification  Battery  as  predictor  variables  in  the  placement  of  Air  Force 
personnel.  The  score  on  the  ACE,  some  sub-scores  of  the  Guilford- 
Zimmerman  Aptitude  Survey  and  the  Scientific  scale  of  the  Kuder  were 
used  by  Stinson  (1958)  in  a  discriminant  analysis  study  of  engineering 
students.  Other  investigators  used  the  achievement  tests,  nonintellec¬ 
tual  variables,  intelligence  tests,  grade  point  index,  high  school 
average,  aptitude  tests,  and  personality  need  constructs.  Most  of  the 
studies  reviewed  showed  positive  results.  However,  King  (1958)  pointed 
out  the  limitation  of  the  discriminant  analysis  in  producing  a  direct 
measure  of  "satisfaction"  or  "appropriateness."  He  cautioned  users  of 
discriminant  results  regarding  direct  placement  of  an  individual  in  a 
field  of  concentration.  The  results  were  better  used  as  a  guideline 
for  the  student  in  his  election  of  a  course  to  pursue. 

One  thing  which  stood  out  in  the  various  studies  reviewed  was 
that  discriminant  analysis  was  a  useful  statistical  tool  for  psychological 
studies.  So  far  the  various  related  studies  had  investigated  samples 
of  high  school  and  college  students  but  none  of  the  type  of  young  adult 
students  pursuing  the  technical  and  vocational  fields  above  the  high 
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school  level  but  below  college  grade  in  institutes  of  technology.  These 
students  constituted  a  large  segment  of  the  future  working  population 
who  no  longer  entered  the  university  but  deserved  some  attention  with 
regard  to  assisting  them  in  choosing  suitable  occupational  training  courses. 
If  Super's  (1954)  view  of  vocational  counseling  from  the  North  American 
cultural  framework  was  accepted  as  valid,  such  assistance  was  surely 
necessary  and  important. 

If  the  findings  of  the  studies  of  O’Hara  (1958)  and  Madaus  and 
O'Hara  (1967)  that  aptitudes  were  better  comprehended  by  the  student  in 
his  senior  or  high  school  graduation  year  was  accepted,  the  use  of  the 
Differential  Aptitude  Tests  as  some  of  the  predictor  variables  in  this 
study  had  justification  for  use  in  investigating  a  population  of  students 
of  the  category  above.  As  mentioned  earlier,  some  of  these  students  had 
outside  work  experiences.  Such  work  experiences  could  also  contribute 
to  their  awareness  of  the  required  aptitudes  in  the  real  world  of  work. 
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CHAPTER  IV 


METHODS  AND  MATERIALS 

The  main  interest  in  this  study  was  to  develop  a  procedure  for 
predicting  training  outcomes  of  students  enrolled  at  Northern  Alberta 
Institute  of  Technology.  From  a  combination  of  nineteen  predictor  var¬ 
iables  the  discriminant  and  centour  classification  techniques  were  used 
to  determine  the  probability  of  a  new  student  entering  the  institute 
satisfactorily  completing  his  training  program  on  time.  The  use  of  this 
scheme  in  vocational  guidance  would  assist  the  incoming  student  consider¬ 
ably  in  choosing  a  program.  Students  might  be  counselled  to  enter  a 
program  somewhat  different  from  their  initial  choice,  if  their  probabil¬ 
ity  of  being  successful  in  another  program  was  greater.  Students  who 
were  likely  to  achieve  a  certain  unsatisfactory  training  outcome  could 
be  identified  beforehand  and  some  preventive  steps  taken  in  their  guidance 
and  instruction.  Furthermore,  this  procedure  might  be  useful  in  planning 
future  policies  of  student  selection  and  placement  at  NAIT. 

Evidence  reported  in  previous  studies  indicated  that  the  problem 
of  determining,  from  a  combination  of  predictor  variables,  which  group 
or  occupational  category  an  individual  resembles  most  could  best  be 
approached  by  multiple  discriminant  analysis  and  by  determining  his 
centour  scores.  How  these  approaches  could  be  carried  out  in  a  school 
guidance  situation  particularly  in  vocational  counselling  in  technical 
institutes  still  needed  further  study. 
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The  Predictor  Variables 

The  nineteen  predictor  variables  used  in  this  study  were  eight 
sub-scores  of  the  Differential  Aptitude  Tests;  two  scores  of  Lorge- 
Thorndike  Intelligence  Tests,  verbal  and  nonverbal;  five  scores  of  the 
Alberta  Grade  IX  Departmental  Examinations,  two  scores  of  the  School 
and  College  Ability  Tests  Level  3,  verbal  and  quantitative;  and  the 
two  high  school  averages  in  Grades  X  and  XI.  Specifically,  the  eight 
sub-scores  of  the  Differential  Aptitude  Tests  were  verbal  reasoning, 
numerical  ability,  abstract  reasoning,  clerical  speed  and  accuracy, 
mechanical  reasoning,  space  relations,  language  usage  I-spelling,  and 
language  II-grammar.  The  combination  sub-score  in  verbal  reasoning  plus 
numerical  ability  although  available  was  not  included.  This  kind  of 
combination  score  would  merely  produce  a  linear  effect  in  multivariate 
analysis  and  would  not  in  any  way  contribute  to  prediction.  The  five 
scores  of  the  Alberta  Grade  IX  Departmental  Examinations  were  reading 
and  literature,  language,  social  studies,  mathematics,  and  science.  Of 
the  nineteen  scores  used  in  this  study  the  first  ten  of  them  were 
gathered  after  the  student  had  enrolled  and  chosen  his  training  course. 
The  last  nine  scores  were  based  on  achievement  of  the  student  prior  to 
his  enrollment  at  NAIT. 

The  letter  grades  in  high  school  subjects  were  converted  by 
assigning  numerical  values  as  follows:  H=  5;  A  =  4;  B  =  3;  C  =  2; 

D  =  1.  Grades  in  percent,  used  in  some  high  school  records  were  also 
converted  as  follows:  80%  to  100%  =5;  65%  to  79%  =  4;  50%  to  64%  =  3; 
40%  to  49%  =2;  0%  to  39%  =  1.  These  particular  conversions  of  grades 
into  these  scales  was  similar  to  a  stanine  scale  distribution. 
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The  study  samples 

As  stated  earlier,  the  guidance  and  counseling  personnel  at 
NAIT  had  been  concerned  with  the  problem  of  providing  further  assist¬ 
ance  to  students  in  the  matter  of  choosing  a  training  program.  To 
secure  some  information  about  students  which  would  be  useful  in  voca¬ 
tional  counseling  the  Differential  Aptitude  Tests  and  the  Lorge-Thorndike 
Intelligence  Tests  were  given  to  all  students  who  enrolled  for  the  first 
time  in  the  various  technologies,  business,  and  vocational  courses  about 
two  months  after  the  beginning  of  the  1966-67  school  term.  These  tests 
were  administered  under  the  supervision  of  Dr.  A.  M.  Bolle,  then  head  of 
the  Guidance  and  Counseling  Services.  Students  who  took  these  tests  had 
already  chosen  and  started  in  their  respective  courses  by  the  usual  pro¬ 
cedure  prescribed  by  the  institute.  It  was  then  assumed  that  choices 
made  by  these  students  were  not  in  any  way  influenced  by  the  test  results. 

The  number  of  students  tested  and  the  courses  they  were  enrolled 
in  at  that  time  is  shown  in  Table  1.  A  total  of  1193  students  took  the 
Lorge-Thorndike  Intelligence  Tests.  However,  not  all  of  them  took  the 
Differential  Aptitude  Tests.  This  study  required  that  all  students  must 
have  had  a  complete  set  of  nineteen  scores  on  the  predictor  variables. 

One  hundred  four  students  who  missed  taking  the  Differential  Aptitude 
Tests  were  not  included  in  the  study  sample. 

The  number  of  subjects  was  further  reduced  by  two  hundred  fifteen 
due  to  unavailability  of  student  marks  in  either  Grade  X  and  XI,  the 
Alberta  Grade  IX  Departmental  Examinations,  or  the  School  and  College 
Ability  Tests  Level  3.  Some  students  were  from  other  provinces  or 
were  former  high  school  graduates  who  had  returned  to  study  after  a  few 
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Table  1.  Number  of  students  tested 
by  program  of  study 


Technology  D.A.T.  Lorge-Thorndike 


Air  conditioning  &  refrig 

Architectural 

Banking  and  finance 

Business  administration 

Chemical 

Civil 

Commercial  cooking 

Dental  assisting 

Dental  laboratory 

Dietary  services 

Distributive 

Drafting 

Electrical 

Electronics 

Exploration 

Forest 

Gas 

Heavy  duty  mechanics 
Electronic  data  process 
Industrial  production 
Instrumentation 
Material 

Medical  laboratory 

Medical  X-ray 

Office  machine  mechanics 

Photography 

Plastics 

Secretarial 

Surveying 

Telecommunications 


8 

8 

53 

60 

19 

19 

95 

102 

69 

77 

47 

54 

17 

17 

28 

34 

15 

18 

20 

21 

34 

53 

36 

47 

22 

24 

158 

166 

30 

30 

55 

56 

17 

18 

15 

15 

34 

40 

10 

11 

25 

25 

18 

19 

75 

75 

32 

32 

15 

15 

20 

22 

14 

14 

56 

58 

11 

12 

41 

51 

Totals 


1089 


1193 
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years  of  absence  from  school.  Scores  and  marks  for  these  students  could 
not  be  secured. 

Finally,  only  868  students  were  included  in  this  study.  The 
graduates  were  grouped  into  nine  training  fields  as  follows: 


Code 

Training  field 

Specific  technology  enrolled 

1 

Control  system 

Air  conditioning  &  refrig. 
Electronic  data  processing 
Gas 

Instrumentation 

2 

Industrial 

Heavy  duty  equipment 
Industrial  production 

Office  machine 

Plastic 

3 

Natural  resources 

Exploration 

Forest 

4 

Engineering 

Civil 

Material 

Surveying 

5 

Plans  and  designs 

Architectural 

Drafting 

Photography 

6 

Laboratory 

Chemical 

Medical  laboratory 

7 

Health 

Dental  assisting 

Dental  laboratory 

Dietary  services 

Medical  X-ray 

8 

Commercial 

Banking  and  finance 

Business  administration 
Distributive 

Secretarial 

9 

Electrical- 

electronics 

Electrical 

Electronics 

Telecommunications 

In  classifying  the  specific  technologies  into  the  various  training 
fields  the  similarities  of  aptitudes  demanded  in  the  technology  and  the 
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characteristics  of  the  working  environment  were  used  as  bases  of  con¬ 
sideration.  The  assumption  was  that  an  individual  tended  to  seek 
occupational  classes  whose  members  have  a  similar  personal  orientation 
to  his  own.  The  studies  of  Holland  and  associates  (1966)  on  vocational 
choice  demonstrated  findings  which  supported  this  tendency.  Some  people 
preferred  to  seek  outdoor  rather  than  indoor  work,  working  among  people 
rather  than  working  alone,  structured  tasks  rather  than  ambiguous  tasks, 
concrete  rather  than  abstract  problems,  verbal  rather  than  nonverbal 
activity,  etc.  The  description  of  skills  and  knowledge  required  of  each 
technology,  as  described  in  the  institute’s  calendar,  was  studied.  Com¬ 
parisons  were  made  among  the  various  technologies  to  arrive  at  an  approp 
riate  matching  of  similar  aptitude  demands  and  environment  orientations 
of  the  various  training  fields.  The  experiences  and  opinions  of  the 
Director  of  Instruction  of  NAIT  were  also  sought  in  deciding  the  final 
classification  of  training  fields. 

On  the  basis  of  actual  training  outcomes  at  the  close  of  the 
1967-68  school  term,  that  was  two  years  later,  students  within  the 
classifications  described  above  were  categorized  as  (a)  graduates,  i.e. 
those  who  graduated  in  their  respective  courses  on  time  or  successfully 
completed  the  second  year  of  their  three-year  course  and  (b)  non-grad¬ 
uates,  i.e.  those  who  failed  to  meet  the  graduation  requirements, 
transferred  or  withdrew  before  completing  a  course.  Graduation  on  time 
or  successful  completion  meant  the  student  passed  all  subjects  with  a 
grade  of  at  least  50%. 

The  graduates  were  classified  into  9  training  fields  and  a  strati 
fied  random  check  sample  was  drawn  and  set  aside  for  purposes  of 
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validation.  The  breakdown  of  criterion  and  validation  samples  classified 
by  training  field  and  technology  is  shown  in  Table  2.  The  number  of 
non-graduates  who  failed  and  transferred  and  withdrew  classified  by  tech¬ 
nology  is  shown  in  Table  3. 

Statistical  procedures 

The  major  part  of  the  statistical  analysis  in  this  study  was 
carried  out  on  an  IBM  360/67  electronic  computer  which  was  available 
at  the  University  of  Alberta  Computing  Center.  The  investigator  sought 
the  assistance  of  the  Division  of  Educational  Research  Services  of  the 
University  in  connection  with  the  preparation  of  the  FORTRAN  programs 
for  multiple  discriminant  analysis  and  centour  score  classification. 

To  test  the  null  hypothesis  (1)  that  there  are  no  significant 
differences  in  group  means  in  each  of  the  nineteen  predictor  variables, 
a  one-way  analysis  of  variance  was  carried  out  on  the  criterion  sample 
of  graduates. 

Five  discriminant  analyses  were  done  on  the  sample  to  determine 
the  most  effective  combination  of  training  groups  and  predictors  using 
MULV10  FORTRAN  program^  (D.E.R.S.,  1968).  The  results  derived  in  each 
analysis  were  used  in  computing  the  discriminant  scores,  chi  square 
values  and  the  corresponding  conditional  and  Bayesian  probabilities  and 
group  classification  for  each  of  the  graduate  criterion  sample  and 
validation  sample.  Comparisons  of  results  were  made  in  terms  of  the 
number  of  predicted  correct  classifications,  close  classifications, 
fairly  close  classifications,  misclassif ications ,  and  the  corresponding 
percentages.  By  close  classification  was  meant  that  centour  score  or 
probability  which  was  next  to  the  correct  classification  and  fairly  close 
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Table  2.  Criterion  and  validation  samples  of  graduates 
classified  by  training  field  and  technology 


Code 

No. 

Training 

field 

Technology  in  this 
classification 

Criterion 

samples 

Vali¬ 

dation 

samples 

Total 

1 

Control 

systems 

Electronic  data 
processing 

Gas 

Instrumentation 

30 

7 

37 

2 

Industrial 

Heavy  duty  equip. 
Indust,  production 
Office  machines 
Plastics 

26 

6 

32 

3 

Natural 

resources 

Exploration 

Forest 

34 

8 

42 

4 

Engineering 

Civil 

Materials 

Surveying 

35 

8 

43 

5 

Plans  and 
design 

Architectural 

Drafting 

Photography 

36 

9 

44 

6 

Laboratory 

Chemical 

Medical  labora¬ 
tory 

54 

13 

67 

7 

Health 

Dental  assisting 
Dental  laboratory 
Dietary  service 
Medical  X-ray 

46 

11 

57 

8 

Commercial 

Banking  &  finance 
Business  admin. 
Distributive 
Secretarial 

56 

14 

70 

9 

Electrical- 

electron¬ 

ics 

Electrical 

Electronics 

Telecommunications 

55 

14 

69 

Total 

371 

90 

461 

) 
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Table  3.  Non-graduates  who  failed  and  transferred  and 
withdrew  classified  by  technology 


Technology  enrolled 

Failed  and 
Transferred 

Withdrew 

Total 

Airconditioning  and 


refrigeration 

Electronic  data 

2 

3 

6 

processing 

5 

10 

15 

Gas 

5 

3 

8 

Ins  trumentation 

Heavy  duty  equipment 

12 

1 

13 

mechanic 

2 

1 

3 

Industrial  production 

2 

2 

4 

Office  machine  mechanic 

3 

0 

3 

Plastics 

1 

1 

2 

Exploration 

0 

2 

2 

Forest 

6 

4 

10 

Civil 

1 

8 

9 

Materials 

1 

5 

6 

Surveying 

2 

2 

4 

Architectural 

17 

17 

34 

Drafting 

5 

0 

5 

Photography 

0 

6 

6 

Chemical 

10 

19 

29 

Medical  laboratory 

3 

12 

15 

Dental  assisting 

4 

2 

6 

Dental  laboratory 

2 

1 

3 

Dietary  services 

2 

2 

4 

Medical  X-ray 

3 

1 

6 

Banking  and  finance 

7 

2 

9 

Business  administration 

22 

21 

43 

Distributive 

6 

11 

17 

Secretarial 

5 

26 

31 

Electrical 

6 

7 

13 

Electronics 

46 

36 

82 

Telecommunications 

5 

11 

16 

Totals 


187 


216 


403 
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classification  was  that  centour  score  or  probability  which  was  next  to 
the  close  classification. 

The  first  analysis  and  classification  involved  9  training  groups 
and  19  predictors.  The  classification  computation  in  this  analysis  and 
in  the  second  analysis  were  done  twice,  first  by  using  the  common  dis¬ 
persion  (MULV11)  and  the  other  by  using  the  separate  dispersion  of  each 
group  (MULV12) ,  (D.E.R.S.,  1968).  These  two  computations  were  made  in 

order  to  verify  whether  the  results  of  both  classification  solutions 
were  similar.  These  comparisons  were  necessary  because  too  wide  dif¬ 
ferences  in  group  dispersions  would  affect  the  effectiveness  of 
classification. 

The  extent  of  overlaps  between  groups  in  the  first  analysis  was 
examined.  Two  of  the  groups  which  had  a  considerable  overlap  were  com¬ 
bined  and  a  second  analysis  and  classification  was  carried  out  involving 
8  training  groups  and  19  predictors  to  find  out  whether  this  procedure 
would  improve  the  effectiveness  of  classification. 

The  discriminant  weights  of  the  predictors  derived  in  the  first 
analysis  were  also  examined  to  identify  predictors  which  probably  con¬ 
tributed  most  to  differentiation  of  training  groups.  Ten  predictors 
with  high  discriminant  weights  were  selected  and  used  in  a  third 
analysis  and  classification  involving  the  same  9  training  groups.  The 
results  of  this  analysis  were  compared  with  those  in  analysis  1  to  ver¬ 
ify  whether  10  predictors  would  differentiate  training  groups  as  effec¬ 
tively  as  19  predictors. 

To  further  compare  the  classification  results  with  those  of  the 
first,  second,  and  third  analyses,  a  fourth  analysis  and  classification 
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was  computed  involving  the  8  training  groups  and  the  selected  10  pre¬ 
dictors  with  high  weights. 

At  NAIT  the  scores  of  12  of  the  predictors  in  this  study  were 
easily  available  to  vocational  counselors.  Except  for  Math.  9  (Alberta) 
and  SCAT  3  (quantitative) ,  these  readily  available  12  predictors  con¬ 
sisted  of  the  10  predictors  used  in  analysis  4  plus  DAT  LU  I  (spelling) 
and  Lorge-Thorndike  (NV) .  To  determine  whether  these  predictors  would 
be  comparatively  useful  for  counseling,  a  fifth  analysis  and  classific¬ 
ation  was  carried  out  involving  the  8  training  groups  and  these  predictors. 
The  results  of  classification  in  this  analysis  were  compared  with  analysis 
2  and  4. 

Further  verification  of  results  in  analysis  2  was  made  by  applying 
the  discriminant  weights,  group  centroids,  and  common  dispersion  matrices 
derived  from  this  analysis  in  computing  centours,  probabilities,  and 
classifications  of  403  non-graduate  students.  Although  it  was  possible 
that  some  non-graduates  would  be  classified  as  graduates  since  reasons 
for  non-graduation  were  not  always  the  lack  of  aptitude  and  ability 
for  technical  training,  the  results  of  this  cross-classification  would 
provide  additional  information  of  the  extent  to  which  this  prediction 
scheme  could  identify  non-graduates  from  graduates.  It  would  also  pro¬ 
vide  assurance  on  the  applicability  of  using  the  results  derived  in 
analysis  2  for  counseling  at  NAIT. 

Besides  the  analyses  and  classifications  described  above  it  was 
felt  necessary  to  investigate  the  question  whether  the  19  predictors 
could  differentiate  between  two  broad  categories  of  students,  i.e.  grad¬ 
uates  and  non-graduates.  As  a  preliminary  step  in  the  counseling  process 
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a  counselor  would  certainly  want  to  know  whether  an  incoming  student 
is  likely  to  graduate  from  the  institute  or  not  before  taking  up  with 
the  student  the  question  of  training  programs  to  choose  from.  Since 
the  counselors  and  department  heads  were  involved  in  deciding  the 
admission  of  new  students,  information  on  this  question  would  be  val¬ 
uable  in  the  selection  process. 

Three  hundred  twenty-three  graduates  and  282  non-graduates  were 
used  in  carrying  out  a  discriminant  analysis  of  two  group  categories 
and  19  predictors.  The  discriminant  weights  obtained  in  this  analysis 
were  then  applied  in  computing  the  centours,  probabilities,  and  class¬ 
ifications  of  138  validation  sample  of  graduates  and  121  non-graduates 
selected  by  stratified  random  sampling. 

Multiple  discriminant  analysis  calculation.  Bryan  (1950)  derived 
the  multiple  discriminant  function  matrix  equation  as: 

(W_1A  -  A  I)v  =  0  (1) 

where  W  is  the  within-group  sums  of  squares  matrix; 

A,  the  among-group  sums  of  squares  matrix; 

A,  the  latent  root; 

I,  the  identity  matrix; 

and  v,  the  latent  vector.  Zero,  on  the  right  hand  side  of  equation  (1) 
represents  a  column  vector  of  zeros.  Equation  (1)  is  similar  in  form  to 
the  Kelly  or  the  Hotelling  principal  axis  method  of  factor  analysis  and 
is  solved  in  a  similar  manner  except  that  W  ^A  is  not  symmetric. 

The  characteristic  equation  of  W  ^A  is  given  by  the  determinantal 
equation : 

|  W_1A  -  A  I |  =  0  (2) 

The  solution  of  the  latent  roots  of  W  1A  in  equation  (2)  satisfies  the 
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discriminant  criterion,  that  is,  of  providing  a  maximum  value  of  re¬ 
spective  ratios  of  among-group  to  within-group  sums  of  squares  along  the 
discriminant  vectors. 

The  latent  vectors  of  W  are  solved  by  substituting,  in 
equation  (1) ,  the  value  of  the  latent  roots  obtained. 

These  latent  vectors  represent  the  corresponding  coefficients  of 
the  discriminant  functions  or  simply  the  discriminant  weights. 

In  solving  equation  (2),  Bryan  (1950)  explained  that  the  number 
of  latent  roots  A,  which  are  equal  to  0,  is  at  most  equal  to  the  number 
of  groups  less  1  or  equal  to  the  number  of  variates  whichever  is  smaller. 
Designating  this  smaller  number  as  _r,  the  total  discriminating  power  of 
the  variates  are  exhausted  by  _r  linear  functions  defined  in  this  manner. 
All  discriminant  functions  corresponding  to  the  distinct  values  of  the 
latent  roots  A  are  uncorrelated  as  they  stand.  The  general  description 
of  the  multiple  discriminant  analysis  (Tiedeman,  Bryan,  and  Rulon, 

1959)  is  quoted  in  full  in  Appendix  I,  pp.  142. 

The  calculation  routine  of  multiple  discriminant  analysis  is 
as  follows: 

Let  be  the  score  for  person  p  in  group  g  on  test  i.  Let  k 

be  the  number  of  groups,  n  the  number  of  people  in  group  g  and  m  the 

§ 

number  of  tests.  Let  N  be  the  total  number  of  people.  Define  a  score 

matrix  X  of  order  N  x  m  of  elements  x  .  as  follows : 

pgi 
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tests 


group  1 


group  2 


X 

(N,m) 


group  g 


xlll 

X112 

•**  Xllm 

X211 

X212 

X21m 

x  ..  . 
pli 

Xn^lm 

X121 

X122 

•"  X12m 

X221 

X222 

**•  X22m 

x  0  . 

p2i 

Xn22m 

g  =  1, 

2 ,  . .  .  ,  k 

xigi 

Xlg2 

•  •  •  X- 

lgm 

P  =  1> 

2 •  • • • •  n  •• 

g 

X2gl 

X2g2 

•  •  •  x 

2gm 

i  =  1, 

2  j  •  •  •  ^  m 

x 

Pgi 

X 

ngm 

XN.  km 
k 

1.  x  is  the  grand  mean  score  for  test  i. 

•  •  i 

2.  The  N  x  m  total  deviation  score  matrix  for  all  groups 

S  with  elements  x  -  x  .  has  the  following  form: 

pgi  ..i 


. 
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S 

(N,m) 


xlll  x..l 

X112 

X  2 

•••  xllm 

X 

• 

.m 

x2ir  x..i 

X212" 

X  2 

X21m" 

X 

• 

.m 

x  ,  .  - 
pli 

x 

.  .1 

Xn^lm 

X 

• 

.m 

X12l"  x..l 

x122" 

x 

X12m" 

X 

• 

.m 

X221~  x..l 

x222" 

x  2 

•  • 

*•*  X22m" 

X 

• 

.m 

Xp21~ 

X 

,  .1 

Xn22m 

X 

• 

.m 

X 

X 

Pgi 

.  .1 

x..  ,  -  x 
N,  km 
k 


.m 


3.  The  m  x  m  total  sums  of  squares  score  matrix  T 

is  the  matrix  product  of 

T  =  S'  .  S 
(m,m)  (m,N)  (N,m) 

4.  The  mean  x  .  is  the  mean  score  of  test  i  for 

•  gi 

group  g. 

5.  The  N  x  m  within  group  deviation  score  matrix 

S  with  elements  x  x  .  has  the  following  form: 

w  pgi  .gi 
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S 

w 

(N  ,m) 


xlll 

X 

• 

11 

X112~ 

X 

• 

,12 

• 

• 

• 

X 

H-* 

i _ , 

3 

1 

X 

« 

,  lm 

x2ii" 

X 

• 

11 

X212“ 

X 

« 

,12 

*  •*  X21m 

X 

• 

,  lm 

Xpli" 

X 

« 

,li 

Xn1lm 

X 

4 

,  lm 

X121' 

X 

• 

21 

X122“ 

X 

4 

,22 

•**  X12m" 

X 

4 

,  2m 

X221_ 

X 

• 

21 

X222_ 

X 

4 

.22 

*•*  X22m" 

X 

« 

.  2m 

x  - 

p2i 

X 

( 

,2i 

Xn22m- 

X 

• 

,  2m 

x  -  x 
Pgi  -gi 


XN,  km  X. km 
k 


6.  The  m  x  m  within-group  sums  of  squares  matrix  W 


is  the  matrix  product  of 


W 

(m,m) 


S' 

w 


(m,N) 


S 

w 

(N,m) 


7.  The  m  x  m  among-group  sums  of  squares  matrix  A 


is  the  difference  of 


A 

(m,m) 


T 

(m,m) 


W 

(m,m) 


8.  The  latent  roots  X  are  obtained  using  the  deter- 


minantal  equation 


W  1A  -  X  I 


=  0 


9.  The  latent  vectors  v  are  computed  using  the 


discriminant  matrix  equation 


(W  XA  -  X  I)v  = 


0 


■ 
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10.  The  latent  vectors  are  normalized  by  multiplying 

by  1/ (  Z  v^)2  ;  i  (variable)  =  1,  2,  m 

i=l 

The  program  used  also  calculated  the  intercorrelation  of  the 
nineteen  variables,  the  standard  deviations  of  the  variates  in  each 
group  and  produced  punched  cards  containing  the: 

1)  means  of  each  variate  in  each  group; 

2)  group  dispersion  matrices; 

3)  total  dispersion  matrix;  and 

4)  transformation  matrix  (latent  vectors  of  W  ^A)  . 

The  punched  cards  were  later  used  in  the  classification  solution  of 
the  criterion  and  validation  samples. 

The  program  also  included  the  computation  of  significance  tests. 

A  test  of  significance  of  the  discriminating  power  of  the  variates  was 
calculated  based  on  the  Wilk’s  lambda  criterion  and  using  the  F-ratio 
approximation  developed  by  Rao  (1952). 

The  test  of  the  dimensionality  of  the  discriminant  space  was 
computed  using  the  chi  square  approximation  also  developed  by  Rao  (1965) . 
The  discriminant  functions  corresponding  to  the  latent  roots  found  sig¬ 
nificant  at  .05-level  served  as  the  basis  for  determining  the  dimension¬ 
ality  of  the  reduced  test  space.  The  corresponding  coefficients  of  these 
discriminant  functions  were  then  used  as  the  transformation  matrix  in 
computing  individual  discriminant  scores. 

Classification  solutions .  Two  programs  were  used  in  the  com¬ 
putation  of  centours ,  probabilities,  and  classifications  of  the  criterion 
and  validation  samples  in  this  study.  One  of  these,  MULVll  was  a  solution 
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using  the  common  dispersion  of  all  groups.  The  calculation  routine  of 
this  program  is  as  follows: 

Let  x  .  be  the  group  mean  score  vector  of  group  g  on  test  i, 

.gi 

T  the  m  x  m  total  dispersion  matrix,  and  V  ^  the  m  x  r  transformation 
matrix  (latent  vectors  of  W  ^A)  of  test  i  in  discriminant  function  f. 
The  data  above  are  obtained  from  the  results  of  the  discriminant 
analysis.  Let  m  be  the  number  of  tests,  r  the  number  functions  in  the 
reduced  discriminant  space,  and  N  the  total  number  of  persons. 


1.  The  discriminant  score  vector 
f  in  the  reduced  space  is 


*p£ 


for  person  p  in  function 


*p£ 

(l,r) 


X  . 

-pi 

(l,m) 


hf 

(m,r) 


p  1,  2,  ...,  N 
l  ”■  1,  2 ^  ...,  m 
f  =  1,  2,  . . . ,  r 


where  x  is  the  score  vector  of  person  p  on  test  i  in  the  test  space 

and  V .  r  is  the  transformation  matrix  of  variable  i  in  function  f. 
if 


2.  The  group  centroid  in  function  f  is 


'•gf 


(l,r) 


x 

-•gi 

(l,m) 


hf 

(m,r) 


g  —  1,  2,  . .  .  ,  k 
i  =  1 ,  2 ,  ...»  m 
f  =  1,  2,  . . .  ,  r 


3.  The  total  dispersion  matrix  D  in  the  reduced  space  is 
D  =  V!.  .  T  .  V, 


V' 

if 

(r,r)  (r,m) 


if 

(m,m)  (m,r) 


l  —  1 ,  3,  ...,  m 
f  =  1,  2,  . . . ,  r 


4.  The  deviation  discriminant  score  vector  of  person  p  in 


group  g  is 


4>g 


yPl  '  y.gl*  yP2  -  y.g2> 


V  ‘  y-gf 


p  =  1,  2,  ...»  N 
f  =  1,  2,  . . . ,  r 


where  y  ^  is  the  discriminant  score  vector 
^pf 

y  gf is  the  group  centroid  vector  of  group  g 


of  person  p  in  function  f  and 
in  function  f.  As  many 
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deviation  discriminant  score  vectors  are  computed  as  there  are  groups. 

5.  The  position  of  the  discriminant  score  vector  of  each  person 
with  respect  to  a  particular  group  in  the  reduced  space  is  given  by  the 
quadratic  form 

X  ~  Zpg  •  D  •  Zpg  P  =  2»  •••»  N 

/ 1  \  /  \  r  =  number  of  functions 

(l,r)  (r , r)  (r,l) 

where  is  the  discriminant  score  deviation  vector  of  person  p  in  each 

group  and  D  is  the  total  or  common  dispersion  matrix  in  the  reduced  space. 
The  quadratic  is  distributed  as  chi  square  with  degrees  of  freedom  equal 
to  the  dimension  of  the  discriminant  space.  As  many  chi  square  values  are 
computed  as  there  are  deviation  discriminant  score  vectors  of  the  person 
in  k  groups. 

6.  The  corresponding  probability  or  centour  score  for  each  chi 
square  value  obtained  in  step  5  is  computed.  The  centour  score  may 

be  obtained  by  multiplying  the  obtained  probability  by  100. 

7.  The  conditional  classification  of  a  person  to  a  certain  group 
is  determined  by 

p(£i  I  H  )  =  p(x2  ^  X2)  i  =  1,  2,  ...,  N 

1  1  J  1  j  =  1,  2,  ...,  k 


where  p  is  the  probability  of  person  i  obtaining  a  score  vector 
given  that  he  is  a  member  of  the  j th  group. 

8.  The  probability  of  group  membership  is  also  computed  by  the 
Bayesian  theorem.  First  a.  priori  probabilities  of  i  groups  are  computed  as 


pj 


•  £  i  n . 

i=l  i 


where  n.  is  the  number  of  persons  in  group  i  based  on  this  study.  The 
proposed  sizes  of  enrollment  for  the  new  school  term  may  be  used  for 


this  purpose. 
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Birnhaum  and  Maxwell  in  Cronbach  and  Gleser  (1965)  explained  the 
Bayesian  probability  formula  as  follows: 


It  then  follows  formally  from  the  formula  of  Bayes  in  elementary 
theory  of  probability  that,  if  an  individual  is  randomly 
selected  from  this  hypothetical  composite  population  and  if  only 
his  measurement  u  is  observed,  then  the  probability  that  he  belongs 
to  category  number  i  is  given  by  p(i|u)  =  g^  p(uji)/p(u).  Here 

p(u)  is  the  probability  that  any  individual  randomly  selected  from 
the  hypothetical  composite  population  has  measurement  u,  that  is, 

k 

p(u)  =  Z  g^  p(u|i).  ...  In  this  hypothetical 

context,  Bayes’  formula  suggests  the  inference  that,  if  an  in¬ 
dividual  is  observed  to  have  measurement  u,  then  he  is  most  likely 
to  belong  to  that  category  i  for  which  the  quantity  g.  p(u|i)  is 
largest.  [pp.  240-241].  1 

is  the  a  priori  probability  that  membership  in  category  i  is  true. 

The  general  equation  used  in  this  study  was  one  cited  in  Cooley 


and  Lohnes  (1962)  as  follows: 


p(Hj|Xi)  = 


Z 

g 


g  —  1,  2,  ...,  k 
i  =  1,  2,  . .  .  ,  N 
j  =  2,  •  •  •  ,  g 


where  p  is  the  probability  that  person  i,  given  a  discriminant  score 
vector  y^,  is  a  member  of  the  jth  group.  This  equation  is  found  by 
substituting  in  Bayes'  probability  formula  the  density  function  of  y^  under 
the  multivariate  normality  assumption.  The  derivation  is  shown  by  Rulon, 
et  al.,  (1967,  pp.  340-342).  Bayesian  classification  of  group  membership 
was  determined  by  the  highest  probability  obtained. 

In  order  to  describe  the  extent  of  overlap  between  groups  in  the 
reduced  discriminant  space,  the  chi  squares  and  corresponding  probabili¬ 
ties  or  centours  of  the  centroids  in  each  group  were  computed  using 


steps  4,  5,  and  6  above.  The  group  centroids  in  this  case  were  treated 
as  individual  discriminant  scores. 

The  MULV12  program  was  a  classification  solution  using  the  sep¬ 
arate  dispersion  matrices  of  each  group  instead  of  the  common  dispersion. 
This  solution  was  carried  out  to  compare  the  results  of  classification 
with  the  MULV11  program  described  earlier.  The  calculation  routine  in 
this  program  was  similar  to  MULV11  except  that  the  separate  group  dis¬ 
persion  matrices  rather  than  the  total  or  common  dispersion  matrix  were 
used  in  obtaining  the  chi  squares  and  corresponding  probabilities  for  each 
person.  MULV11  and  MULV12  are  given  in  Appendices  III  &  IV,  pp.  151,  154. 

The  concept  of  centour  score.  The  word  centour  derives  from  the 
combination  of  "cent"  from  percent  and  "our"  from  contour.  The  percentage 
of  the  area  of  a  normal  distribution  beyond  the  contour  on  which  a  point 
lies  conveys  the  idea  of  centour.  (Bryan,  1951)  Suppose  the  mean  of 
a  univariate  normal  distribution  is  19  and  the  standard  deviation  is  3.72. 
An  individual  whose  raw  score  is  24  is  equal  to  a  deviation  score  of  +5 
and  a  standard  score  of  +1.344.  From  a  table  of  areas  of  normal  dis¬ 
tribution  the  proportion  which  exceeds  +1.344  is  .089  or  8.9%.  In  the 
centour  concept  a  deviation  score  of  +5  is  also  a  deviation  of  the  same 
magnitude  in  the  opposite  direction.  For  a  raw  score  of  24  the  centour 
score  is  equal  to  17.8.  This  is  the  sum  of  the  proportion  of  the  two 
tails  of  the  distribution  as  defined  by  the  standard  score  value.  A 
centour  score  of  17.8  means  that  17.8%  of  the  distribution  are  more  di¬ 
vergent  from  the  centroid  (which  in  this  case  is  the  mean  of  19)  than 
a  raw  score  of  24,  (Rulon,  et  al.  ,1967). 

In  a  multivariate  distribution  for  a  group,  the  centour  score 
indicates  the  percent  of  score  combination  which  is  probably  farther 


■ 
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away  in  all  directions  from  the  group  centroid.  A  centour  score  of  90 
indicates  that  the  score  combination  of  the  person  is  near  the  group 
centroid  while  a  centour  score  of  less  than  1  indicates  the  score  com¬ 
bination  of  the  person  is  very  far  away  from  the  group  centroid.  A 
centour  score  can  serve  as  an  index  for  determining  whether  an  individ¬ 
ual  resembles  certain  g  groups  as  defined  by  this  test  scores,  (Rulon, 
et  al  ,  1967)  . 

For  the  general  case  the  centour  score  is  determined  from  the 
value  of  chi  square  which  is  derived  from  the  exponent  of  the  equation 
of  a  normal  distribution. 


The  locus  of  points  of  equal  density,  sometimes  referred 
to  as  the  locus  of  equiprobability  or  iso-frequency,  in  a  normal 
distribution  satisfies  the  equation 


x  . 


g 


=  X 


2 

g 


with  x  being  a  row  vector  of  deviation  scores  for  any  number  of 
tests  T,  and  with  D  being  the^T  x  T  dispersion  matrix  for  the  T 
tests  ...  In  this  equation  x  is  an  arbitrary  constant.  The 

2  8 

larger  values  of  X  are  associated  with  loci  of  lesser  density. 
[Rulon,  et  al,  1967,  p.  191]. 

2 

The  fixed  value  of  X  may  be  determined  from  the  probab¬ 
ility  table  of  chi  squares.  The  table  gives  the  value  of 
2 

X  associated  with  the  argument  P,  the  probability  of  a  diver- 

2 

gence  greater  that  X  for  various  degrees  of  freedom.  [Rulon, 
et  al,  1967,  p.  118]. 

The  derivation  of  this  quadratic  is  shown  by  Rulon,  et  al.  (1967, 
pp.  115-117).  In  this  study  the  calculation  of  the  chi  squares  and 
corresponding  probabilities  were  included  in  the  computer  program  in¬ 
stead  of  referring  to  a  table  of  chi  squares. 

For  each  subject  in  the  validation  and  criterion  samples  the 


computer  output  the  results  of  the  classification  solutions  and  listed 
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the  person’s  identification  number,  the  vector  of  discriminant  scores, 
the  chi  square  values  and  corresponding  probabilities  of  membership 
in  each  group,  and  group  numbers  indicating  the  group  in  which  the  person 
was  classified.  The  computer  output  indicated  two  types  of  probabilities 
for  each  of  the  groups:  (a)  a  posteriori  probabilities  based  on 
Bayes’  theorem  on  a  given  a  priori  probabilities  and  (b)  conditional 
probabilities  (centours)  based  on  the  chi  square  distribution  for 
degrees  of  freedom  equal  to  the  dimensionality  of  the  reduced  discriminant 
space.  It  was  an  easy  matter  to  convert  these  conditional  probabilities 
to  centour  scores  by  multiplying  by  100  or  by  moving  the  decimal  point 
two  places  to  the  right.  However,  this  was  not  necessary.  Leaving 
the  decimal  point  as  shown  in  the  computer  print-out  did  not  in  any  way 
alter  the  concept  of  centour  interpretation. 

There  were  two  classification  numbers  shown.  One  of  these  numbers 
showed  the  group  to  which  the  subject  probably  belonged  if  classifica¬ 
tion  was  based  on  the  conditional  rule  (i.e.  his  highest  centour  score). 
The  other  number  was  for  classification  based  on  the  Bayesian  rule  (i.e. 
his  highest  posteriori  probability) .  The  output  format  in  these  com¬ 
puter  classification  programs  could  be  used  directly  by  a  vocational 
counselor  without  any  further  modification  or  additional  construction  of 
reference  tables.  (see  Chapter  VI,  p.  130  for  explanation) 

Since  the  interest  in  this  study  was  to  develop  a  procedure  for 
identifying  membership  of  persons  in  certain  training  groups  for  voca¬ 
tional  counseling  two  approaches  were  considered.  After  the  first  run 
of  all  the  statistical  analysis  described  above  the  results  were  studied 
to  find  the  extent  of  overlap  between  training  groups.  The  extent  of 


' 
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overlap  could  be  examined  from  the  centour  scores  of  the  group  centroids. 
The  larger  the  centour  scores  between  two  group  centroids  the  greater  was 
the  overlap  of  the  corresponding  ellipsiods  and  the  lesser  would  be  the 
effectiveness  of  classification.  This  was  also  seen  by  graphing  the  dis¬ 
criminant  vector  scores  of  the  group  centroid  in  the  reduced  discriminant 
space. 

One  of  the  approaches  was  to  carry  out  a  discriminant  analysis 
and  classification  solution  based  on  a  reduced  number  of  training  groups 
but  maintaining  the  same  19  predictors  and  find  out  how  well  this  would 
improve  the  classification  of  groups.  The  data  of  two  groups  which  had 
large  overlap  were  combined  as  one  group  in  this  analysis.  Of  course, 
some  information  was  lost  in  this  manner.  This  also  narrowed  down  the 
extent  of  alternatives  of  classification.  However,  this  drawback  could 
be  compensated  by  an  improvement  of  prediction. 

The  second  approach  was  to  reduce  the  number  of  predictors  by 
selecting  those  predictors  with  high  discriminant  weights  found  in  the 
first  analysis  and  then  carry  out  a  third  analysis  to  verify  whether  dif¬ 
ferentiation  of  groups  would  improve  or  decrease  compared  with  19 
predictors . 

Comparison  of  results  were  made  in  terms  of  the  number  of  correct 
classifications,  close  classifications,  fairly  close  classification, 
misclassif ications ,  and  corresponding  percentages.  The  decision  as  to 
which  analysis  and  classification  solution  was  desirable  was  judged  on 
its  utility  for  vocational  counseling. 

By  studying  the  positions  of  group  centroids  in  the  reduced 
space  as  illustrated  in  the  graphs,  psychological  interpretations  were 
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speculated  upon  regarding  the  meaning  of  the  distribution  of  scores  of 
the  criterion  sample; 


CHAPTER  V 


ANALYSIS  OF  DATA 

As  shown  in  Table  2,  out  of  461  enrollees  who  graduated  a 
stratified  random  sampling  of  90  cases  was  set  aside  for  later  validation 
The  remaining  371  cases  were  used  as  the  criterion  sample  in  the  analysis 
Four  hundred  three  non-graduate  subjects  shown  in  Table  3  were  later 
used  in  cross-classification  to  compare  the  extent  of  misclassif ication 
that  would  occur  if  prediction  weights  of  graduates  were  applied  on  these 
students.  The  first  analysis  involved  9  training  fields  and  19  predictor 
variables . 

A  one-way  analysis  of  variance  was  used  to  test  the  significance 
of  group  difference  of  means  and  homogeneity  of  variance  on  each  of 
the  19  predictors.  The  group  means  of  each  variable,  the  F-ratios,  the 
chi  square  values  and  the  corresponding  probabilities  are  shown  in  Table 
4.  All  observed  F-ratios  were  significant  at  .013-level.  Hypothesis 
(1)  of  no  difference  in  group  means  was  rejected.  There  were  significant 
differences  in  group  means  and  this  indicated  that  the  variables  could 
be  used  to  discriminate  groups  in  this  study.  The  chi  square  values 
were  significant  at  .91  >  p  >  .06  except  for  three  predictors,  DAT  NA, 

DAT  MR,  and  DAT  LU  I  (spelling)  which  were  significant  at  .002  >  p  >  .001 
The  data  did  not  contradict  the  hypothesis  of  homogeneity  of  variance 
except  with  three  variables  which  perhaps  required  a  larger  number  of 
subjects.  The  main  approach  of  this  study  was  multivariate.  Its  overall 
results  were  quite  different  from  a  study  of  variables  taken  singly.  The 


/ 


Table  4.  Analysis  1:  Means,  Univariate  F-tests, 
Chi  square  tests  of  homogeneity  of  vari 
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preliminary  univariate  tests  were  used  at  the  earlier  stage  of  this 
study  for  the  purpose  of  examining  the  predictive  qualities  of  the  var¬ 
iables  and  the  adequacy  of  sample  size.  At  this  stage  the  preliminary 
significance  tests  give  favourable  results. 

Another  preliminary  examination  of  the  predictive  utility  of  the 
measures  was  made  by  studying  the  inter correlations  of  the  19  variables. 

The  intercorrelations  and  population  means  of  the  criterion  sample  of 
graduates  are  shown  in  Table  5.  As  a  whole,  the  intercorrelations  were 
relatively  low  which  was  another  indication  of  the  favourable  possibilities 
of  these  variables  to  differentiate  various  training  groups  of  graduates. 
However,  out  of  the  171  pairs  of  correlations  26  pairs  had  correlations 
of  .50  to  .73.  Negative  correlations  varied  from  -.002  to  -.17  while 
positive  correlations  ranged  from  0  to  .73.  Some  correlations  of  .50 
or  over  were  found  between: 

1)  DAT  AR  and  DAT  SR,  Lorge-Thorndike  (nonverbal) ; 

2)  DAT  SR  and  Lorge-Thorndike  (nonverbal) ; 

3)  DAT  LU  II  (grammar)  and  DAT  VR,  DAT  LU  I  (spelling),  Lorge-Thorndike 

(verbal),  Read. -lit.  9  (Alberta),  Lang.  9  (Alberta); 

4)  DAT  VR  and  SCAT  3  (verbal) ; 

5)  Lorge-Thorndike  (verbal)  and  DAT  VR,  Read. -lit  9  (Alberta.),  SCAT 

3  (verbal); 

6)  Read. -lit.  9  (Alberta)  and  DAT  VR,  Lang.  9  (Alberta)  Soc.  Stu.  9 

(Alberta),  SCAT  3  (verbal); 

7)  Lang.  9  (Alberta)  and  DAT  LU  I  (spelling),  Soc.  Stu.  9  (Alberta), 

SCAT  3  (verbal).  Grade  X  average,  Grade  XI  average; 


. 


,  , 


Table  5.  Analysis  1:  Intercorrelation  and  criterion  means 
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8)  Science  9  (Alberta)  and  Soc.  Stu.  9  (Alberta),  Math.  9  (Alberta), 

Grade  X  average; 

9)  SCAT  3  (verbal)  and  SCAT  3  (quantitative) ; 

10)  Grade  X  and  Grade  XI  average. 

Considering  the  similarities  of  the  characteristics  measured  by  these 
variables  the  correlations  of  .50  or  over  were  quite  possible.  For 
example,  DAT  VR  and  Lorge-Thorndike  (verbal)  which  were  quite  similar 
measures  had  a  correlation  of  .73.  Between  Math.  9  (Alberta),  and 
Science  9  (Alberta)  the  correlation  was  .62.  DAT  NA  and  DAT  CSA  showed 
consistently  low  correlations  with  all  the  other  variables. 

This  study  hypothesized  that  the  scores  made  by  individuals  on 
the  nineteen  predictor  variables  were  related  to  their  membership  in 
occupational  groups,  that  is,  these  variables  could  be  used  to  deter¬ 
mine  different  regions  in  an  m  dimensional  test  space  which  certain 
groups  tended  to  occupy.  A  preliminary  verification  of  this  hypothesis 
was  made  by  testing  the  significance  of  the  difference  in  group  centroids 
by  using  the  F-ratio  approximation  developed  by  Rao  (1952).  The  re¬ 
sults  of  these  tests  are  shown  at  the  bottom  of  Table  6.  The  lambda 
obtained  was  .263  and  the  corresponding  F-ratio  of  3.33  was  significant 
at  the  .001- level.  It  was  concluded,  therefore,  that  the  group  centroids 
in  the  graduate  criterion  sample  were  not  the  same.  The  generalized 
multivariate  null  hypothesis  (2)  that  the  characteristics  of  the  various 
training  groups,  as  measured  by  the  nineteen  predictor  variables,  were 
the  same  was  not  tenable.  The  approximation  test  of  significance  of  the 
discriminating  power  of  the  variables  further  indicated  that  they  could 
possibly  be  effective  predictors  for  identifying  membership  in  certain 
training  groups  of  students  at  NAIT. 
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Analysis  _1  (9_  training  groups ,  19  predictors)  .  The  roots  of 
W  and  the  chi  square  tests  of  significance  of  dimensionality  of  the 
discriminant  space  are  shown  in  Table  6.  The  maximum  of  8  linear  roots 
were  obtained  in  this  analysis.  The  first  three  roots  were  significant 
at  .003-level.  From  these  results  the  first  three  discriminant  functions, 
whose  roots  were  highly  significant,  were  used  as  the  new  dimension  of 
the  reduced  space.  The  original  19-dimensional  test  space  was  then 
reduced  to  a  3-dimensional  discriminant  space.  The  corresponding  dis¬ 
criminant  weights,  as  shown  in  Table  7,  were  used  as  the  transformation 
matrix  in  converting  the  orginal  scores  to  discriminant  scores  for 
each  subject  of  the  criterion  and  validation  samples. 

The  standard  deviations  of  the  variables  for  each  of  the  nine 
training  groups  are  shown  in  Table  7.  Variations  in  standard  devia¬ 
tions  among  the  nine  training  groups  roughly  explained  the  high  or  low 
weighting  of  a  particular  variable.  For  example,  Soc.  Stu.  9  (Alberta) 
consistently  showed  low  weights  in  all  the  three  discriminant  functions. 
Standard  deveiations  of  this  variable  among  the  nine  training  groups 
also  showed  very  little  variations  in  values. 

The  standard  deviations  of  DAT  MR  varied  from  .4  to  .9.  Its 
weight  in  discriminant  function  I  was  high.  The  Grade  XI  average  had 
low  weights  in  discriminant  function  I  and  III  but  somewhat  high  weights 
in  function  II.  Standard  deviations  of  this  variable  were  similar  in 
some  training  groups  but  varied  by  . 2  in  other  groups.  The  variations 
in  standard  deviations,  in  a  general  way,  could  explain  the  high  or 
low  discriminant  weighting  on  a  particular  variable. 
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Table  6.  Analysis  1:  Roots,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
and  multivariate  F-test 


Root 

X 

DF 

Chi  square 

P 

Percent 

1 

.9230 

152 

464.17 

.001 

55.59 

2 

.2998 

126 

236.61 

.001 

18.05 

3 

.1334 

102 

145.35 

.003 

8.03 

4 

.1139 

80 

101.77 

.051 

6.86 

5 

.0763 

60 

64.23 

.331 

4.59 

6 

.0658 

42 

38.65 

.619 

3.96 

7 

.0365 

26 

16.48 

.924 

2.19 

8 

.0121 

12 

4.01 

.983 

.73 

Total  100.00 


Lambda  =  .263 

DF  =  152;  DF2  =  2565 

F 

p  <.001 


3.33 


J 

Table  7.  Analysis  1:  Discriminant  weights  and 
standard  deviations  of  groups 
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The  variables  with  high  weights  in  discriminant  function  I  were 
DAT  LU  II  (grammar) ,  DAT  MR,  DAT  NA  while  those  with  very  low  weights 
were  Soc.  Stu.  9  (Alberta),  SCAT  3  (verbal),  and  Lang.  9  (Alberta). 

In  discriminant  function  II  high  weights  were  found  in  SCAT  3 
(quantitative),  DAT  AR,  Grade  X  average,  Math.  9  (Alberta),  and  DAT  VR 
while  those  containing  very  low  weights  were  Read. -lit.  9  (Alberta),  DAT 
NA,  and  Soc.  Stu.  9  (Alberta). 

In  discriminant  function  III,  DAT  SR,  SCAT  3  (quantitative),  DAT 
NA,  and  Lorge- Thorndike  (verbal)  were  weighted  high,  while  SCAT  3 
(verbal),  DAT  MR,  and  Read. -lit.  9  (Alberta)  were  weighted  low. 

The  group  mean  vectors  in  the  reduced  discriminant  space  are 
shown  in  Table  8.  The  nine  group  centroids  in  the  reduced  space  are 
graphically  illustrated  in  Figure  2.  From  this  figure,  centroids  1 
(control  system) ,  2  (industrial) ,  5  (plans  and  design) ,  and  9  (elec¬ 
trical-electronics)  were  quite  close  to  each  other.  The  ellipsoids  of 
these  three  groups  overlapped  to  some  extent.  Correct  classification 
of  persons  in  these  groups  would  not  be  as  efficient  as  those  in  groups 
3,  4,  6,  7,  and  8,  which  were  quite  apart. 

To  further  examine  the  extent  of  overlap,  the  centroid  centours 
for  each  training  group  are  shown  in  Table  9.  Centour  in  this  table  is 
based  on  the  assumption  that  there  is  constant  dispersion  within  each 
group.  For  example,  if  the  centroid  of  group  1  falls  on  the  X  percentile 
ellipse  of  the  multivariate  normal  distribution  then  we  shall  say  that 
group  1  centroid  is  an  X  centour  with  respect  to  group  2.  The  pairs  of 
centours  of  training  groups  1,  2,  5,  and  9  taken  two  at  a  time  were 
quite  high.  This  meant  that  their  centroids  were  close  to  each  other  and 
indicated  a  high  degree  of  overlap  of  their  isofrequency  ellipsoids. 
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Table  8.  Analysis  1:  Group  centroid  vectors  in 
discriminant  space 


Group 

No. 

Training  group 

I 

Discriminant 

II 

function 

III 

1 

Control  system 

-1.60 

-3.13 

2.37 

2 

Industrial 

-1.67 

-2.84 

2.22 

3 

Natural  resources 

-1.69 

-3.52 

1.87 

4 

Engineering 

-2.16 

-3.46 

1.90 

5 

Plans  and  design 

-1.81 

-3.20 

2.49 

6 

Laboratory 

-0.96 

-3.80 

2.23 

7 

Health 

-0.50 

-3.09 

2.17 

8 

Commercial 

-1.25 

-2.94 

1.90 

9 

Electric-electronics 

-2.02 

-3.26 

2.26 

Table  9.  Analysis  1:  Centours  of  group  centroids 
in  discriminant  space 


Group 

Training  groups 

Centroid 

1 

2 

3 

4  5  6 

7 

8 

9 

1 

100 

2 

95 

100 

3 

72 

58 

100 

4 

54 

48 

86 

100 

5 

97 

86 

65 

62 

100 

6 

40 

19 

48 

14 

28 

100 

7 

25 

19 

13 

02 

11 

49 

100 

8 

75 

82 

61 

29 

49 

37 

54 

100 

9 

88 

80 

79 

89 

95 

19 

05 

44 

70 
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Correct  classifications  of  persons  in  these  groups  were  expected  to  be 
few.  Predicted  classifications  were  determined  of  the  subjects  in  the 
criterion  and  validation  samples.  Two  kinds  of  classification  solutions 
were  carried  out.  One  of  these  solutions  computed  the  centour  scores, 
Bayesian  probabilities,  and  classifications  using  the  total  or  common 
dispersion  of  all  groups.  The  results  of  classification  of  the  criterion 
sample  using  this  solution  are  shown  in  Table  10. 

Under  the  conditional  rule  34%  of  the  subjects  in  the  criterion 
sample  were  correctly  classified.  If  the  counselor  would  consider  the 
second  highest  centour  as  a  close  classification  the  percentage  of 
prediction  would  increase  to  58%.  If  the  counselor  considered  the  third 
highest  centour  as  fairly  close  classification,  it  would  further  increase 
the  prediction  to  an  overall  result  of  71%  of  the  subjects  in  the  cri¬ 
terion  sample.  These  results  indicated  that  the  19  predictors  differ¬ 
entiated  the  9  training  groups  in  the  criterion  sample  fairly  well. 

Under  the  Bayesian  classification  rule  for  the  same  subjects  in 
the  criterion  sample,  there  was  no  correct  classification  in  training 
groups  1  and  2.  There  were,  however,  close  and  fairly  close  classifica¬ 
tion  in  these  two  groups.  The  number  of  correct  classifications  in  train¬ 
ing  groups  6,  7,  8,  and  9  were  far  higher  than  those  under  the  conditional 
rule.  The  Bayesian  rule  did  well  when  sizes  of  the  groups  were  large 
but  not  so  when  group  sizes  were  small.  Under  this  classification  rule, 
37%  of  the  subjects  in  the  criterion  sample  were  correctly  classified. 

If  the  results  of  the  close  and  fairly  close  classification  were  included, 
then  70%  might  be  considered  as  correctly  predicted.  This  result  was 
about  the  same  as  the  conditional  classification  but  again  was  higher 
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than  those  in  the  validation  samples  shown  in  Table  13.  The  small 
number  of  classifications  in  training  groups  1  and  2  is  likely  due  to 
the  large  overlap  of  these  groups  as  shown  in  the  graphical  plot  of 
the  group  centroids  in  Figure  2  and  due  to  the  small  sample  size  in 
these  groups. 

Centours ,  Bayesian  probabilities,  and  classification  were  also 
computed  on  the  same  subjects  in  the  criterion  sample  by  the  second  sol¬ 
ution,  that  is,  using  the  separate  dispersion  of  each  group  instead  of 
the  common  dispersion.  Predicted  correct  classifications  under  the 
conditional  rule  as  shown  in  Table  11  indicated  an  increase  in  groups 
2,  3,  6,  8,  and  9  but  a  decrease  in  groups  5  and  7.  There  was  an  in¬ 
crease  in  the  number  of  predicted  correct  classification  from  34%  to 
37% 

Under  the  Bayesian  rule,  groups  1  and  2  were  not  well  predicted. 
There  were  some  increases  in  the  number  of  correct  classifications  in 
groups  2,  3,  4,  6,  and  8  but  they  were  cancelled  out  by  the  decrease  in 
correct  classifications  in  groups  5,  7,  and  9.  The  predicted  correct 
classification  was  37%  of  the  subjects  in  the  criterion  sample.  As  a 
whole,  there  was  slight  improvement  in  prediction  when  the  separate  dis¬ 
persion  of  each  group,  instead  of  the  common  dispersion,  was  used  in 
computing  the  centours,  probabilities,  and  classification.  For  this 
sample  it  indicated  that  the  results  in  both  solutions  were  not  markedly 
different. 

Similarly,  the  two  classification  solutions  were  applied  to  the 
subjects  in  the  validation  sample.  The  results  of  predicted  classifica¬ 
tion,  using  the  common  dispersion  is  shown  in  Table  12.  None  of  the 
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Table  12.  Analysis  1:  Classification  of  validation 
sample  using  the  common  dispersion 
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individuals  in  groups  1  and  3  under  the  conditional  rule  and  groups  1, 

2,  and  3  under  the  Bayesian  rule  were  correctly  classified.  This  was 
likely  due  to  the  limited  size  of  check  samples  in  these  groups  and  to 
the  large  overlap  between  these  groups.  A  total  of  27%  to  28%  was  cor¬ 
rectly  classified  under  both  rules.  Considering  that  the  probability  of 
predicting  the  correct  group  to  which  a  person  belonged  was  1  in  9  or 
0.11  the  results  were  fairly  good.  The  overall  predicted  classification 
under  both  rules,  close  and  fairly  close  included,  was  57%  to  63%. 

The  classification  of  the  subjects  in  the  validation  sample  using 
the  separate  dispersion  of  each  group  is  shown  in  Table  13.  The  pattern 
of  results  was  not  so  different  compared  with  the  results  using  the  common 
dispersion.  Under  both  conditional  and  Bayesian  rules  the  results 
accounted  for  22%  to  26%  of  the  validation  samples  correctly  predicted. 

The  overall  predicted  classifications  were  60%  to  62%  correct.  In  these 
data  the  results  of  both  classification  solutions  were  not  markedly 
different. 

In  the  two  classification  rules,  it  was  noted  that  when  the  per¬ 
son  was  correctly  classified  by  the  conditional  rule  but  misclassif ied 
by  the  Bayesian  rule,  the  probability  under  the  Bayesian  rule  was  often 
a  close  one,  that  is,  if  one  considered  a  close  or  fairly  close  classifica¬ 
tion  as  a  correct  classification  then  both  the  conditional  and  the  Bayesian 
rules  were  performing  similarly  in  classifying  the  person.  Also,  when 
the  person  was  correctly  classified  by  the  Bayesian  rule  but  misclassif ied 
by  the  conditional  rule,  the  conditional  centour  value  was  often  close 
to  the  highest  centour.  In  other  words,  if  the  counselor  considered  the 
classifications  given  in  both  the  conditional  and  Bayesian  rules,  he 
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could  not  be  very  far  from  naming  the  training  group  to  which  a  person 
probably  belonged,  providing  the  scheme  was  predicting  sufficiently 
beyond  probability  expectation.  The  classification  results  showed  that 
the  nineteen  variables  could  discriminate  different  training  groups  of 
students  at  NAIT. 

Analysis  2  (8^  training  groups ,  19  predictors) .  As  illustrated  in 
Figure  2,  the  centroids  of  groups  1,  2,  and  5  were  quite  close  to  each 
other.  Furthermore,  as  shown  in  Table  9  these  groups  overlapped 
considerably.  The  approach  was  to  do  a  discriminant  analysis  and  class¬ 
ification  solution  based  on  8  training  groups,  that  is,  combining  groups 
1  and  2  and  leaving  the  remaining  groups  as  they  were.  The  nature  of  the 
training  and  aptitude  requirement  of  groups  1  (control  system)  and  2 
(industrial) ,  which  consisted  mostly  of  mechanical  skills  with  tools  and 
instruments  were  somewhat  similar  and  would  not  appreciably  change  the 
picture  of  the  a  priori  groupings.  This  procedure  would  probably  im¬ 
prove  the  number  and  accuracy  of  correct  classification.  The  breakdown 
of  criterion  and  validation  samples  in  this  analysis  is  shown  in  Table 
14.  Groups  1  and  2  combined  consisted  of  55  graduates.  In  this  second 
analysis  the  first  three  roots  were  found  to  be  significant  at  the  .002- 
level,  as  shown  in  Table  15.  Three  discriminant  functions  were  used  to 
determine  the  dimensionality  of  the  reduced  space.  The  F-ratio  approx¬ 
imation,  for  testing  the  difference  in  group  centroids,  was  3.56.  This 
was  significant  at  the  .001-level. 

Discriminant  weights  of  the  19  variables  are  shown  in  Table  16. 
Twelve  variables  showed  high  discriminant  weights.  They  were  the  DAT  VR, 
DAT  NA,  DAT  AR,  DAT  CSA,  DAT  MR,  DAT  SR,  DAT  LU  I,  DAT  LU  II,  Math.  9 
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Table  14.  Analysis  2:  Criterion  and  validation  samples  of 

graduates  classified  by  training  fields  and  technology 


Code 

No. 

Training 

field 

Technology  in  this 
classification 

Criterion 

sample 

Valida¬ 

tion 

sample 

Total 

12 

Control  sys- 
indus trial 

Electronic  data 
processing 

Gas 

Instrumentation 
Heavy  duty  equip. 
Indust,  production 
Office  machines 
Plastics 

55 

14 

69 

3 

Natural 

resources 

Forest 

Exploration 

34 

8 

42 

4 

Engineering 

Civil 

Materials 

Surveying 

35 

8 

43 

5 

Plans  and 
design 

Drafting 

Architectural 

Photography 

35 

9 

44 

6 

Laboratory 

Chemical 

Medical 

54 

13 

67 

7 

Health 

Dental  assisting 
Dental  laboratory 
Dietary 

Medical  X-ray 

46 

11 

57 

8 

Commercial 

Banking  &  finance 
Business  admin. 
Distributive 
Secretarial 

56 

14 

70 

9 

Electrical- 

electronics 

Electrical 

Electronics 

Telecommun. 

55 

14 

69 

Total 

370 

91 

461 
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Table  15.  Analysis  2:  Roots,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
and  multivariate  F-test 


Root 

A 

DF 

Chi  square 

P 

Percent 

1 

.9252 

133 

434.14 

<  .001 

59.29 

2 

.2552 

108 

205.86 

<  .001 

16.35 

3 

.1357 

85 

126.64 

.002 

8.69 

4 

.0959 

64 

82.30 

.061 

6.14 

5 

.0644 

45 

50.40 

.268 

4.12 

6 

.0519 

28 

28.66 

.430 

3.32 

7 

.0327 

13 

11.02 

.609 

2.09 

Total  100.00 


Lambda  =  .288 

DF1  =  133;  DF2  =  2284 

F  =  3.56 

p  <  .001 
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Table  16. 


Analysis  2:  Discriminant  weights  of 
three  functions 


Discriminant  function 


Predictor  variables 

I 

II 

(normalized  weights) 

III 

1.  DAT  VR 

-.047 

-.226 

-.113 

2.  "  NA 

-.522 

-.009 

-.249 

3.  "  AR 

.158 

.257 

-.142 

4.  "  CSA 

i 

.174 

-.055 

-.202 

5.  "MR 

-.647 

-.195 

-.177 

6.  "  SR 

.123 

.199 

.291 

7.  "  LU 

I  (spell) 

.120 

.239 

-.211 

8.  "  LU 

II  (grammar) 

.354 

-.025 

.233 

9.  Lorge-Thorndike  (V) 

-.097 

-.162 

-.158 

10. 

"  (NV) 

-.157 

- .  066 

.108 

11.  Read. -lit.  9  (Alberta) 

.101 

-.013 

-.002 

12.  Lang.  9 

(Alberta) 

.015 

.143 

.022 

13.  Soc.  Stud.  9  (Alberta) 

.001 

.062 

-.077 

14.  Math.  9 

(Alberta) 

-.053 

-.374 

-.060 

15.  Science 

9  (Alberta) 

-.048 

.055 

.054 

16.  SCAT  3 

(V) 

.001 

-.170 

.071 

17.  SCAT  3 

(quantitative) 

.126 

.391 

.259 

18.  Grade  X 

average 

.125 

-.562 

.627 

19 .  Grade  XI  average 

.124 

-.221 

-.366 

' 


(Alberta),  SCAT  3  (quantitative),  Grade  X  average,  and  Grade  XI  average. 
Except  for  the  addition  of  DAT  CSA,  DAT  LU  I,  Grade  XI  average,  and  the 
exclusion  of  Lorge-Thorndike  (V) ,  these  variables  were  the  same  as  those 
found  with  high  weights  in  analysis  1.  Similar  observations  were  also 
noted  in  a  subsequent  analysis  3. 

From  the  centours  of  the  group  centroids  in  this  analysis,  as 
shown  in  Table  18,  the  large  overlaps  were  in  groups  1,  2,  3,  4,  5,  and 
9.  The  relative  location  of  the  group  centroids  in  the  reduced  space 
illustrated  in  Figure  3,  showed  that  groups  3  and  5  were  very  close  to 
each  other  if  viewed  on  the  plane  bounded  by  function  II  and  III.  The 
clustering  of  group  centroids  was  reduced  slightly. 

Results  of  classification  of  subjects  in  the  criterion  and  val¬ 
idation  samples  are  shown  in  Tables  19,  20,  21,  and  22.  Compared  with 
the  results  in  analysis  1  the  predicted  classifications  of  subjects  in 
the  validation  sample  increased  from  27%  to  33%  correct.  The  overall 
predicted  classifications  increased  from  60%  to  71%  correct.  However, 
groups  3  and  5  were  not  well  classified. 

The  classification  results  of  subjects  in  the  criterion  sample 
were  also  higher  than  in  analysis  1.  This  improvement  in  prediction  in 
this  second  analysis  was  likely  due  to  the  reduction  of  overlap  between 
groups  when  training  groups  1  and  2  were  combined  into  one  group. 

The  use  of  either  the  common  dispersion  or  the  separate  dispersion 
of  each  group  in  computing  the  classifications  gave  similar  results.  In 
the  later  classification  computation  done  in  this  study,  the  common  dis¬ 
persion  instead  of  the  separate  dispersion  of  each  group  was  used  to  min¬ 
imize  calculations.  It  was  demonstrated  in  the  second  analysis  that  the 


' 
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Table  17.  Analysis  2:  Group  centroid  vectors  in 
discriminant  space 


Group 

No. 

Training  Group 

Discriminant  function 

I  II  III 

12 

Control  sys-  industrial 

-1.32 

-2.44 

-2.32 

3 

Natural  resources 

-1.33 

-2.92 

-2.46 

4 

Engineering 

-1.79 

-2.88 

-2.34 

5 

Plans  and  design 

-1.44 

-2.55 

-2.13 

6 

Laboratory 

-0.55 

-3.19 

-2.24 

7 

Health 

-0.14 

-2.48 

-2.11 

8 

Commercial 

-0.90 

-2.34 

-2.40 

9 

Electrical- electronics 

-1.67 

-2.69 

-1.98 

Table  18.  Analysis  2:  Centours  of  group  centroids  in 
discriminant  space 


Group  _ Training  group 


Centroid 

12 

3 

4 

5 

6 

7 

8 

9 

12 

100 

3 

84 

100 

4 

73 

85 

100 

5 

96 

77 

80 

100 

6 

29 

49 

15 

27 

100 

7 

19 

12 

02 

14 

53 

100 

8 

89 

64 

32 

69 

42 

50 

100 

9 

74 

60 

.  83 

95 

16 

05 

35 

100 

84 


Figure  3-  Ana/ysis  2  :  l oca/ ion  o/  yroi/y>  centra ic/s 
m  a// s c riminan t  Sy>ace 
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Table  21.  Analysis  2:  Classification  of  validation 
sample  using  the  common  dispersion 
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Table  22.  Analysis  2:  Classification  of  validation 
sample  using  the  separate  dispersion 
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Note:  Numbers  in  parenthesis  are  number  of  classifications  based  on  random  assignment 
of  subjects  to  groups  in  proportion  to  the  total  number  of  subjects  in  each  gro 
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procedure  of  combining  groups  had  some  advantage  in  improving  the  ef¬ 
ficiency  of  classification,  although  it  could  not  be  overdone  at  the 
expense  of  reducing  the  number  of  groups  to  be  classified. 

Analysis  _3  (9  training  groups ,  10  predictors) .  From  the  results 
of  the  first  analysis  it  was  found  that  out  of  19  predictor  variables, 
ten  of  them  had  discriminant  weights  of  at  least  .30.  A  third  discrimin¬ 
ant  analysis  was  done  to  verify  how  well  these  highly  weighted  variables 
would  perform  in  discriminating  9  training  groups  of  NAIT  graduates.  Ten 
predictors  used  in  this  analysis  were  the  DAT  VR,  DAT  NA,  DAT  AR,  DAT  MR, 
DAT  SR,  DAT  LU  II,  Lorge-Thorndike  (verbal),  Math.  9  (Alberta),  SCAT  3 
(quantitative) ,  and  the  Grade  X  average. 

Tables  23,  24,  25,  and  26  show  the  results  of  analysis  3.  The 
first  three  roots  in  this  analysis  were  found  significant  at  the  .001- 
level.  The  observed  F-ratio  of  5.09  was  significant  at  the  .001-level. 
Again,  three  discriminant  functions  were  used  to  determine  the  dimen¬ 
sionality  of  the  reduced  space.  The  positions  of  high  discriminant 

weights  of  these  predictors  were  exactly  the  same  as  those  found  in 

> 

analysis  1  (refer  to  Table  7).  These  results  indicated  the  possibility 
of  using  a  lesser  number  of  predictors  than  the  19  variables  used  in 
analysis  1. 

The  results  of  the  classification  of  the  subjects  in  the  cri¬ 
terion  and  validation  samples  are  shown  in  Table  27  and  28.  Thirty- 
four  percent  to  35%  of  the  subject  in  the  criterion  sample  were  correctly 
classified  under  both  classification  rules.  The  overall  predicted 
classification,  close  and  fairly  close  classifications  included,  was  68% 


to  70%  correct. 


1  bnuo-1  ..  >iii  m  e  srU  yl^axa  ««*»*  I  *«<  *  e;  *'*“ 
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Table  23.  Analysis  3:  Roots,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
and  multivariate  F-test 


Root 

X 

DF 

Chi  square 

P 

Percent 

1 

.7948 

80 

372.26 

<  .001 

61.02 

2 

.2499 

63 

166.09 

<  .001 

19.18 

3 

.1191 

48 

87.47 

<  .001 

9.14 

4 

.0634 

35 

47.82 

.073 

4.87 

5 

.0433 

24 

26.15 

.346 

3.33 

6 

.0243 

15 

11.20 

.739 

1.86 

7 

.0049 

8 

2.73 

.950 

0.37 

8 

.0029 

3 

1.02 

.797 

0.22 

Total  100.00 


Lambda  =  .348 

DF  =  80;  DF2  =  2247 

F  =  5.09 

p  <  .001 


> 
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Table  24. 


Analysis  3:  Centours  of  group  centroids 
in  discriminant  space 


Group 

No. 

Training 

Group 

Discriminant  function 

I  II  III 

1 

Control  system 

-2.54 

4.12 

2.58 

2 

Industrial 

-2.55 

3.80 

2.41 

3 

Natural  resources 

-2.57 

4.42 

1.96 

4 

Engineering 

-2.97 

4.43 

2.04 

5 

Plans  and  design 

-2 . 66 

4.20 

2.59 

6 

Laboratory 

-1.96 

4.74 

3.39 

7 

Health 

-1.58 

4.05 

2.29 

8 

Commercial 

-2.22 

3.87 

2.10 

9 

Electrical- electronics 

-2.72 

4.18 

2.43 

Table  25.  Analysis  3:  Group  centroid  vectors 
in  discriminant  space 


Group  _ Training  group 


centroid 

1 

2 

3 

4 

5 

6 

7 

8 

1 

100 

2 

94 

100 

3 

72 

64 

100 

4 

59 

51 

87 

100 

5 

99 

89 

73 

70 

100 

6 

44 

24 

49 

16 

37 

100 

7 

23 

22 

17 

03 

14 

56 

100 

8 

73 

85 

68 

33 

59 

43 

57 

100 

9 

97 

90 

83 

83 

99 

32 

11 

62 
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Table  26. 


Analysis  3:  Discriminant  weights 
of  three  functions 


Discriminant  function 

Predictor  variables  I  II  III 

(normalized  weights) 


1. 

DAT  VR 

-.044 

.375 

-.136 

2. 

"  NA 

-.543 

-.035 

-.535 

3. 

"  AR 

.079 

-.346 

.219 

4. 

"  MR 

-.624 

.211 

.150 

5. 

"  SR 

.091 

-.071 

.376 

6 . 

M  LU  II  (grammar) 

.462 

-.171 

.322 

7. 

Lorge-Thorndike  (V) 

-.043 

.178 

-.351 

8. 

Math.  9  (Alberta) 

-.079 

.294 

-.009 

9. 

SCAT  3  (quantitative) 

.168 

-.330 

.457 

10. 

Grade  X  average 

.223 

.657 

.218 

. 
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Table  28.  Analysis  3:  Classification  of  validation 
sample  using  the  common  dispersion 
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Predicted  classifications  of  the  subjects  in  the  validation 
sample  were  26%  to  32%  correct.  Groups  1  and  3  were  not  correctly 
classified  but  these  results  were  also  similar  to  analysis  1.  The 
overall  predicted  classifications  were  59%  to  63%  correct.  As  a  whole 
the  prediction  in  analysis  3  were  slightly  less  accurate  than  those 
in  analysis  1.  Generally  there  was  some  merit  in  considering  an 
analysis  involving  the  ten  highly  weighted  predictors  in  this  study 
instead  of  the  original  nineteen. 

Analysis  4_  (8^  training  groups ,  10  predictors) .  It  was  shown 
in  the  second  analysis  that  there  was  a  marked  increase  in  correct 
classifications  when  the  groups  were  reduced  by  combining  groups  1  and 
2,  thus  using  eight  groups  instead  of  the  original  nine.  A  fourth 
analysis  was  done  involving  the  10  highly  weighted  predictors  and  8 
training  groups.  This  would  further  verify  the  merits  of  the  two 
approaches  used. 

From  the  results  of  this  analysis,  as  shown  in  Tables  29,  30, 

31,  and  32,  the  first  three  roots  were  found  significant  at  the  .004- 
level.  For  testing  the  difference  in  group  centroids  the  F-ratio  ap¬ 
proximation  of  5.34  was  found  to  be  significant  at  the  .001-level. 

Three  discriminant  functions  were  used  to  determine  the  dimensionality 
of  the  reduced  space.  Again  the  high  weights  of  the  ten  predictors 
obtained  in  the  three  functions  were  similar  to  those  in  analysis  2. 

The  results  of  the  classification  of  subjects  in  the  criterion  and 
validation  samples  in  this  analysis  are  shown  in  Tables  33  and  34.  Both 
classification  rules  accounted  for  32%  to  34%  of  all  the  criterion  sam¬ 
ple  correctly  predicted.  These  results,  however,  were  lower  than  those 
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Table  29.  Analysis  4:  Roots,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
and  multivariate  F-test 


Root 

X 

DF 

Chi  square 

P 

Percent 

1 

.8033 

70 

342.66 

<  .001 

66.39 

2 

.2097 

54 

134.52 

<  .001 

17.33 

3 

.1001 

40 

67.32 

.004 

8.27 

4 

.0453 

28 

33.65 

.213 

3.75 

5 

.0253 

18 

18.00 

.456 

2.09 

6 

.0213 

10 

9.17 

.516 

1.76 

7 

.0049 

4 

1.72 

.787 

.41 

Total  100.00 


Lambda  =  .379 

BF1  =  70;  DF2  =  2065 

F  =  5.34;  p  <  .001 
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Table  30.  Analysis  4:  Discriminant  weights  of  three  functions 


Discriminant  function 


Predictor  variables 

I 

II 

(normalized  weights) 

III 

1. 

DAT  VR 

.027 

.228 

-.136 

2. 

"  NA 

-.509 

.013 

-.473 

3. 

"  AR 

.081 

-.273 

.105 

4. 

"  MR 

-.657 

.270 

.093 

5. 

"  SR 

.111 

-.155 

.443 

6 . 

"  LU  II  (grammar) 

.436 

-.140 

.123 

7. 

Lorge- Thorndike  (V) 

-.067 

.204 

-.327 

8. 

Math.  9  (Alberta) 

-.065 

.371 

-.042 

9. 

SCAT  3  (quantitative) 

.158 

-.404 

.473 

10. 

Grade  X  average 

.255 

.643 

.439 

:  . 
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Table  31. 


Analysis  4:  Group  centroid  vectors  in 
discriminant  space 


Group  Training  Discriminant  function 

No.  Group  I  II  III 


12 

> 

Control  sys  -  industrial 

-2.39 

4.35 

2.08 

3 

Natural  resources 

-2.36 

4.81 

1.67 

4 

Engineering 

-2.76 

4.81 

1.85 

5 

Plans  and  design 

-2.44 

4.53 

2.34 

6 

Laboratory 

-1.71 

5.09 

2.11 

7 

Health 

-1.36 

4.36 

2.03 

8 

Commercial 

-2.02 

4.25 

1.83 

9 

Electrical- electronics 

-2.51 

4.53 

2.18 

Table  32.  Analysis  4:  Centours  of  group  centroids 
in  discriminant  space 


Group 

Centroid 

Training 

group 

12 

3 

4 

5 

6 

7 

12 

100 

3 

79 

100 

4 

74 

85 

100 

5 

96 

67 

73 

100 

6 

33 

45 

16 

34 

100 

7 

20 

15 

03 

14 

58 

100 

8 

85 

70 

37 

63 

47 

56 

9 

98 

79 

86 

99 

30 

12 

8 
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66 
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Table  34.  Analysis  4:  Classification  of  validation 
sample  using  the  common  dispersion 
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obtained  in  analysis  2.  The  overall  predicted  classifications  were 
72%  to  73%  correct. 

Predicted  classifications  of  subjects  in  the  validation  sample 
were  30%  to  34%  correct.  These  results  were  also  lower  than  analysis 
2.  The  overall  predicted  classifications  were  70%  to  74%  correct. 

Analysis  4,  involving  8  training  groups  and  10  predictors  produced  re¬ 
sults  which  were  slightly  lower  than  those  of  analysis  2,  which  involved 
8  training  groups  and  19  predictors.  If  a  situation  arose  where  the 
data  of  the  19  predictors  were  not  all  available,  the  vocational  counselor 
could  use  analysis  4  as  an  alternate  predicting  scheme  for  use  in 
counseling . 

Analysis  _5  (8^  training  groups ,  12  predictors)  .  At  the  Northern 
Alberta  Institute  of  Technology,  the  test  scores  of  students  in  the 
Differential  Aptitude  Tests,  the  Lorge-Thorndike  Intelligence  Tests, 
and  the  high  school  averages  in  Grades  X  and  XI  were  readily  available 
to  the  vocational  counselor.  As  mentioned  earlier,  the  vocational  guid¬ 
ance  and  counseling  services  of  NAIT  had  administered  these  tests  to 
new  students  who  were  applying  for  admission.  Noting  that  the  results 
of  discriminant  analysis  4  indicated  a  little  lower  results  compared 
with  analysis  2,  and  that  most  of  the  sub-tests  of  these  tests  stated 
above  were  those  used  in  analysis  4,  a  fifth  analysis  was  carried  out 
involving  8  training  groups  and  12  predictors  to  verify  the  feasibility 
of  an  alternate  predicting  scheme.  The  12  predictors  used  were  the 
eight  sub-scores  of  the  DAT,  the  two  sub-scores  of  the  Lorge-Thorndike, 
and  the  high  school  averages  in  Grades  X  and  XI.  These  variables  in¬ 
cluded  8  predictors  which  were  used  in  analysis  4.  The  same  training 
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groups  which  were  used  in  analysis  2  were  used  in  this  analysis. 

Results  of  analysis  5  are  shown  in  Tables  35,  36,  37,  and  38. 

Again  the  first  three  roots  were  found  significant  at  the  .001-level. 

Three  discriminant  functions  were  used  to  determine  the  dimensionality 
of  the  reduced  space.  Testing  the  difference  in  group  centroids,  the 
observed  F-ratio  of  4.91  was  found  to  be  significant  at  the  .001-level. 

Of  the  12  predictor  variables  used  in  analysis  5,  only  the  Lorge- Thorn¬ 
dike  sub-tests  had  low  discriminant  weights  in  all  three  functions. 

Results  of  the  classification  of  subjects  in  the  criterion  and 
validation  samples  are  shown  in  Tables  39  and  40.  For  subjects  in  the 
criterion  sample  both  classification  rules  accounted  for  35%  to  37% 
correct  prediction.  These  results  were  3%  higher  than  the  results  in 
analysis  4,  but  2%  lower  than  those  in  analysis  2.  The  overall  result 
of  predicted  classifications  was  74%  correct.  This  was  a  little  higher 
than  the  results  in  analysis  4  but  a  little  lower  than  those  in  analysis  2. 

Classifications  of  subjects  in  the  validation  sample  resulted  in 
34%  to  35%  correct  prediction.  These  results  were  2%  higher  than  those 
in  analysis  4  but  again  about  2%  lower  than  those  in  analysis  2.  The 
overall  result  of  predicted  classifications  was  64%  to  67%  correct.  This 
was  lower  than  the  results  obtained  in  analysis  2  and  4. 

As  a  whole  the  classification  results  in  the  fifth  analysis  which 
used  12  predictors  was  comparable  with  those  in  analysis  4  which  used 
10  predictors.  The  prediction  based  on  the  fifth  analysis  was  less 
accurate  than  those  based  on  analysis  2  which  used  19  predictors.  There 
was  some  advantage  in  using  more  predictors  in  this  study. 


,  • 
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Table  35.  Analysis  5:  Roots,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
and  multivariate  F-test 


Root 

X 

DF 

Chi  square 

P 

Percent 

1 

.8865 

84 

377.02 

<  .001 

65.79 

2 

.1833 

66 

153.60 

<  .001 

13.60 

3 

.1087 

50 

94.36 

<  .001 

8.07 

4 

.0733 

36 

58.03 

.011 

5.44 

5 

.0390 

24 

33.12 

.101 

2.89 

6 

.0337 

14 

19.66 

.141 

2.50 

7 

.0231 

6 

8.01 

.237 

1.  71 

Total  100.00 


Lambda  =  .343 

DF1  =  84;  DF2  =  2158 

F  =  4.91;  p  =  <  .001 
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Table  36. 


Analysis  5:  Discriminant  weights  of  three 
functions 


Discriminant  function 


Predictor  variables 

I 

II 

(Normalized  weights) 

III 

1. 

DAT  VR 

.009 

.226 

.161 

2. 

"  NA 

-.515 

.108 

.208 

3. 

"  AR 

.199 

-.312 

.141 

4. 

"  CSA 

.196 

-.015 

.228 

5. 

"  MR 

-.660 

.221 

.331 

6 . 

"  SR 

.096 

-.109 

-.316 

7. 

"  LU  I  (spell) 

.133 

-.328 

.164 

8. 

"  LU  II  (grammar) 

.369 

.046 

-.246 

9. 

Lorge- Thorndike  (V) 

-.037 

.116 

.051 

10. 

"  "  (NV) 

-.167 

.105 

-.103 

11. 

Grade  X  average 

.096 

.796 

-.543 

12. 

Grade  XI  average 

.143 

.102 

.503 
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Table  37.  Analysis  5:  Group  centroid  vectors  in 
discriminant  space 


Group 

Training 

Discriminant  : 

function 

No. 

Group 

I 

II 

III 

12 

Control  sys  -  industrial 

-1.06 

2.31 

3.81 

3 

Natural  resources 

-1.01 

2.48 

3.81 

4 

Engineering 

-1.48 

2.62 

3.77 

5 

Plans  and  design 

-1.17 

2.37 

3.61 

6 

Laboratory 

-0.28 

2.84 

3.77 

7 

Health 

0.05 

2.35 

3.52 

8 

Commercial 

-0.66 

2.15 

3.76 

9 

Electrical-electronics 

-1.40 

2.56 

3.47 

Table 

38. 

Analysis  5 :  Centours  of 
discriminant  space 

group 

centroids 

in 

Group 

Training  groups 

Centroid 

12 

3 

4 

5 

6 

7 

8 

9 

12 

100 

3 

98 

100 

4 

79 

83 

100 

5 

95 

93 

86 

100 

6 

35 

50 

16 

28 

100 

7 

19 

22 

03 

16 

63 

100 

8 

88 

83 

36 

74 

49 

53 

100 

9 

72 

74 

90 

93 

16 

06 

38 

100 

* 

106 


d 

o 

•H 

U 

ID 

4->  d 
•H  O 
H  X 
O  CO 
H 

X  CD 
O  Cu 
CO 


d 

•H 

o 

T3 

•H 

4-1 

C 

CO 

O 

O 

s 

•H 

14-4 

0 

•H 

0 

CO 

CO 

CD 

CO 

X 

I — 1 

X 

<_> 

00 

c 

••  X 

m  co 
d 
CO 

•H  CO 
CO  <U 
to  X 


<C  co 


• 

ON 

CO 

<d 

H 

Xi 

cd 

H 


d 

O 

•H 

X 

nj 

o 

•H 

X 

•H 

CO 

CO 

co 

t— I 

CJ 

M-4 

o 

a) 

CM 

to 

H 


CD 

X 

d 

u 

d 

cO 

•H 

CO 

CD 

cO 

PQ 


CO 

CO 

•H 

£ 


cO 

x 

o 

H 


to 

X  (D 
H  CO 
X  O 
cO  t— I 
x  a 


CD 

co 

o 


4-1 

o 

<D 

U 

H 

o 

o 


co 

co 

X 

£ 


CO 

4-1 

o 

H 


to 

rH  CD 
H  CO 
•H  O 
CO  rH 
M-4  O 


<D 

CO 

o 


o 

CD 

u 

u 

o 

o 


d  CD 
•H  rH 

a 

o  § 
2  co 


GO 

d  CO 
•H  CX. 
d  3 
•H  O 
c0  H 
U  O 
H 


rH 

CN 

CN 

X 

00 

X 

X 

X 

X 

X 

iH 

CN 

rH 

rH 

X 

1 — 1 

ON 

X 

Of 

CN 

CO 

0 

X 

X 

or 

of 

of 

or 

of 

rH 

CN 

CN 

of 

of 

of 

of 

r-- 

r-'- 

X 

X 

X 

CN 

0 

X 

0 

X 

X 

X 

X 

1 — 1 

tH 

of 

X 

CM 

X 

CO 

O 

CN 

X 

0 

X 

X 

X 

rH 

rH 

tH 

tH 

X 

X 

ON 

X 

✓ — \ 

/^S 

/ — \ 

/-s 

CN 

rH 

CO 

X 

a> 

m 

X 

O  • 

O 

. 

00 

• 

O 

. 

ON 

• 

00 

• 

ON 

• 

X 

• 

r>* 

r-. 

CN  00 

CO 

co 

X 

CN 

r- 

CN 

m 

X 

00 

X 

CO 

X 

X 

V 

V-/ 

t 

V-/ 

S^/ 

X 

ON 

ON 

ON 

CN 

00 

X 

r- 

X 

X 

tH 

tH 

X 

X 

ON 

X 

X 

X 

X 

CO 

X 

X 

ON 

CO 

or 

of 

X 

CN 

CN 

CN 

Of 

or 

X 

CO 

r-'- 

X 

CN 

Of 

Of 

r"- 

0 

X 

X 

X 

X 

rH 

rH 

X 

X 

rH 

O 

of 

X 

ON 

r" 

of 

tH 

r-i 

rH 

X 

X 

00 

X 

/ — \ 

/o 

/— s 

/-N 

CN 

rH 

CO 

X 

m 

X 

CO  • 

of 

. 

X 

• 

X 

• 

X 

• 

ON 

• 

of 

• 

X 

. 

X 

X 

tH  00 

CO 

tH 

CO 

X 

CN 

r^. 

X 

in 

X 

00 

X 

00 

X 

X 

0/ 

' _ / 

V 

V-/ 

X 

X 

Of 

X 

X 

Of 

X 

X 

X 

0 

0 

x 

CO 

CO 

CO 

X 

or 

X 

X 

r-^ 

0 

X 

X 

cn 

•  tH 

0 

CO  CO 

CO 

00 

> 

1 

•H 

X 

Eo  X 

CD 

C 

X 

X 

a 

d 

CO  M 

0 

•H 

T3 

co 

co 

0 

X 

a) 

4-1 

H 

d 

d 

O 

•H 

P 

0 

u 

co 

0 

tH  CO 

rH 

d 

CD 

cO 

60 

4-» 

0 

•H 

X 

X 

M 

O  d 

CO 

0 

<D 

•H 

CO 

X 

H 

u 

0 

0 

<D 

H 

co 

d 

CO 

CO 

H 

4-4 

CD 

4-1 

(D 

H 

CH 

4-1  d 

d 

CD 

•H 

d 

CD 

O 

X 

g 

O 

X 

C  M 

4-» 

60 

CO 

nd 

X 

co 

B 

CD 

<D 

O 

CO 

d 

tH 

cO 

CD 

O 

X 

CJ 

Z 

W 

fH 

X 

EC 

CJ 

w 

CN 

rH 

CO 

of 

X 

X 

CO 

ON 

Note:  Numbers  in  parenthesis  are  number  of  classifications  based  on  random  assignment 

of  subjects  to  groups  in  proportion  to  the  total  number  of  subjects  in  each  group. 


107 


c 

o 

•H 

P 

c0 

X) 

•H 

C 

rH 

O 

CO 

•H 

> 

CO 

P 

CP 

CU 

O 

a 

co 

G 

•H 

O 

X) 

•H 

P 

G 

CO 

o 

CJ 

s 

•H 

§ 

4-) 

o 

•H 

o 

CO 

CO 

cu 

CO 

je 

rH 

p 

U 

60 

G 

•  • 

•H 

m 

CO 

G 

co 

•H 

CO 

CU 

rH 

G- 

CO 

s 

G 

cO 

< 

CO 

• 

O 

<r 

cu 

i — i 

rO 

CO 

H 

g 

o 

•H 

P 

CO 

O 

•H 

4-i 

•H 

CO 

CO 

cO 


4-1 

O 

CL) 

CU 


CU 

tH 

3 

P 

G 

cO 

•H 

CO 

0) 

SH 

CO 

OQ 


CO 

CO 

•H 

£ 


CO 

p 

o 

H 


to 

tH  CU 
l-l  CO 
•H  O 
CO  rH 
4-1  CJ 


a) 

CO 

o 


p 

o 

a) 

P 

P 

O 

CJ 


CO 

CO 

•H 

a 


cO 

P 

O 

H 


So 

rH  CU 
P  CO 
•H  O 
CO  tH 
>p  a 


cu 

co 

o 


a 

<u 

p 

p 

o 

o 


G  a) 

•H  rH 
CU 

o  i 

ss  cn 


00 

G 

•H 

G 

•H 

CO 


a. 

G 

o 

p 


P  o 
Eh 


r-~- 

tH 

CM 

m 

CM 

ID 

co 

o 

CO 

co 

CO 

o 

iH 

r--. 

I"". 

00 

a 

00 

tH 

rH 

rH 

tH 

vD 

iD 

i — 1 

rH 

o 

n- 

rH 

o 

CO 

O 

CO 

o- 

! — 1 

rH 

O’ 

o 

O’ 

O 

rH 

o- 

CM 

CM 

a 

rH 

t — i 

, — \ 

/'-'S 

/"N 

/^N 

N 

CM 

r^ 

a 

O' 

CO 

CM 

CM 

m  • 

o 

. 

co 

• 

O 

. 

VO 

• 

m 

• 

CO 

• 

a 

• 

rH 

o- 

CM 

o 

o 

o 

rH 

rH 

CM 

CM 

co 

CO 

'w' 

>w^ 

'w' 

v-/ 

v—' 

r^. 

rH 

CM 

m 

CM 

f"» 

co 

CO 

vD 

co 

CO 

CM 

r"- 

r^- 

oo 

a 

n. 

rH 

00 

O’ 

rH 

m 

vD 

i — i 

O 

o 

m 

o 

o 

CO 

CM 

tH 

CM 

rH 

tH 

CM 

iH 

CM 

o 

CM 

CM 

rH 

m 

m 

rH 

rH 

/ — V 

/«s 

/^v 

CM 

C''. 

a 

Os 

co 

CM 

CM 

O' 

tH 

• 

m 

• 

CM 

• 

co 

• 

1"- 

• 

CO 

• 

<r 

• 

CM 

m 

CM 

o 

o 

o 

rH 

rH 

CM 

CM 

CO 

CO 

V-/ 

' — / 

V 

v-t 

'w' 

s-/ 

- — s 

o- 

00 

00 

at 

co 

rH 

O’ 

o- 

tH 

o 

rH 

rH 

rH 

rH 

tH 

a 

o 

tH 

r\ 

CO 

•  iH 

a 

CO  CO 

co 

60 

l 

’H 

P 

So  *H 

<u 

C 

So 

tH 

rH 

G 

G 

CO  P 

o 

•H 

X) 

P 

CO 

CO 

O 

tH 

<U 

P 

p 

P 

G 

G 

o 

♦H 

O 

P 

CO 

a 

iH  CO 

rH 

G 

CU 

co 

60 

p 

O 

•H 

P 

P 

p 

O  G 

CO 

o 

CU 

•H 

CO 

rC 

P 

P 

CJ 

O 

cu 

P  X) 

p 

CO 

G 

co 

CO 

P 

P 

<U 

P 

<U 

EH 

(P 

P  G 

G 

cu 

•H 

G 

a) 

O 

tH 

s 

O 

1 — 1 

G  M 

P 

p 

60 

cO 

X) 

rO 

CO 

0 

CU 

<u 

O 

cO 

G 

tH 

CO 

CU 

o 

rH 

CJ 

Z 

W 

(P 

hJ 

33 

o 

W 

CM 

#1 

rH 

co 

O’ 

m 

00 

a 

Note:  Numbers  in  parenthesis  are  number  of  classifications  based  on  random  assignment 

of  subjects  to  groups  in  proportion  to  the  total  number  of  subjects  in  each  group. 


108 


Cross-classification  on  non-graduates 

The  prediction  scheme  developed  in  this  study  was  applied  on  a 
sample  of  non-graduates  to  verify  how  well  it  would  classify  a  different 
population  category.  As  shown  in  Table  3,  there  were  403  non-graduates 
who  were  not  included  in  the  previous  analyses.  Centours,  Bayesian 
probabilities,  and  classifications  were  computed  on  all  individuals  in 
this  sample.  The  prediction  weights  obtained  in  analysis  2  based  on  8 
training  groups  and  19  predictors  were  used  in  the  classification 
solution. 

Certainly  the  expectation  was  that  non-graduates  would  be  mis- 
classified.  However,  this  might  not  be  so  for  all  of  this  type  of 
student.  The  causes  of  students  withdrawing  from  school  or  transferring 
to  another  course  may  not  be  similar  for  all  students.  A  student  may 
withdraw  from  school  not  because  he  lacks  aptitude  for  the  training 
course  but,  perhaps  because  the  course  lacks  challenge.  Possibly  he 
could  have  graduated  had  he  stayed  and  finished  the  course. 

The  results  of  the  cross-classification  are  shown  in  Table  41. 
Twenty-two  percent  to  23%  of  the  subjects  in  the  non-graduate  sample 
were  correctly  classified  while  36%  to  39%  were  missed  entirely. 

These  results  were  opposite  in  trend  to  the  results  obtained  in  analysis 
2  (refer  to  Tables  19  and  21).  In  analysis  2  the  predicted  classifica¬ 
tions  of  graduates  were  33%  to  38%  correct  while  22%  to  26%  were 
misclassified. 

The  cross-classification  result  indicated  that  the  prediction 
scheme  developed  in  this  study  could  classify  about  half  of  the 
non-graduate  population  who  were  likely  to  lack  aptitude  and  abilities 
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for  technical  training.  However,  to  identify  non-graduates  as  such  was 
another  problem  which  was  dealt  with  in  a  later  analysis.  From  the 
verifications  carried  out,  the  results  indicated  that  the  prediction 
scheme  developed  here  could  be  a  useful  tool  for  vocational  counseling 
at  NAIT. 

Psychological  interpretations 

Eight  instead  of  nine  training  groups  in  discriminant  analysis  2, 
showed  a  fairly  good  number  of  correct  classifications.  The  results  of 
analysis  2  were  further  used  in  examining  how  the  scores  of  the  NAIT 
graduate  criterion  sample  were  distributed  in  the  three-dimensional 
discriminant  space.  The  location  of  the  8  group  centroids  in  this 
reduced  space  as  shown  in  Figure  3  was  plotted  as  projections  along 
each  discriminant  function  from  low  to  high  and  is  shown  in  Figure  4. 

The  purpose  was  to  derive  a  psychological  meaning  which  might  be 
attributed  to  each  of  the  functions.  Along  discriminant  function  I, 
the  group  centroids  were  found  to  arrange  from  low  to  high  as  follows: 

Low  -  Health 

-  Laboratory 

-  Commerical 

-  Industrial-control  systems 

-  Natural  resources 

-  Plans  and  design 

-  Electrical-electronics 
High  -  Engineering 

As  examination  of  the  arrangement  of  the  groups  indicated  that 
those  at  the  high  end  were  those  requiring  mechanical  skills  and  abil¬ 
ities.  Discriminant  function  I  may  be  labeled  as  mechanical  attribute. 
Health,  laboratory,  and  commercial  training  groups  probably  did  not  re¬ 
quire  as  much  of  the  mechanical  attribute  as  did  the  electrical  and 
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engineering  groups.  In  function  I,  the  DAT  LU  II (grammar) ,  DAT  CSA, 
and  DAT  AR  were  the  variables  weighted  highly  positive  while  DAT  MR 
and  DAT  NA  were  the  variables  weighted  highly  negative.  This  mechanical 
attribute  might  be  interpreted  in  terms  of  measures  of  mechanical  and 
numerical  abilities  (represented  by  the  DA  MR  and  DAT  NA  variables)  on 
one  extreme  end  of  the  graduate  population  and  measures  of  language, 
clerical,  and  abstract  reasoning  (represented  by  the  DAT  LU  II  -  grammar, 
DAT  CSA,  and  DAT  AR)  variables  on  the  other  extreme  end  of  the  population. 

In  discriminant  function  II  the  group  centroids  were  found  to 
range  from  low  to  high  as  follows: 

Low  -  Commercial 

-  Industrial-control  systems 

-  Health 

-  Plans  and  design 

-  Electrical-electronics 

-  Engineering 

-  Natural  resources 
High  -  Laboratory 

This  psychological  attribute  could  probably  be  labeled  as  in¬ 
tellectual.  Laboratory  and  engineering  groups  probably  require  higher 
intellectual  abilities  than  those  in  commercial  and  industrial-control 
systems  groups.  In  function  II,  the  SCAT  3  (quantitative)  and  DAT  AR 
were  weighted  highly  positive  while  Grade  XI  average,  Math.  9  (Alberta), 
and  DAT  VR  were  weighted  highly  negative.  The  intellectual  attribute 
might  be  interpreted  in  terms  of  measures  of  general  intelligence 
(represented  by  SCAT  3  and  DAT  AR)  variables  on  one  extreme  end  of  the 
population  and  measures  of  scholastic  achievement  (represented  by  Grade 
XI  average.  Math.  9  (Alberta),  and  DAT  VR)  variables  on  the  other  extreme 


end  of  the  population. 
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In  discriminant  function  III,  the  group  centroids  were  found  to 
range  from  low  to  high  as  follows: 

Low  -  Electrical-electronics 

-  Health 

-  Plans  and  design 

-  Laboratory 

-  Industrial-control  systems 

-  Engineering 

-  Commercial 

High  -  Natural  resources 

This  psychological  attribute  could  probably  be  labeled  as  verbal- 
enterprising.  The  electrical-electronics  and  plans  and  design  groups 
probably  require  less  of  the  verbal-enterprising  attribute  than  those  in 
the  commercial  and  natural  resources  groups.  In  function  III,  Grade  X 
average,  DAT  SR,  SCAT  3  (quantitative)  were  weighted  highly  positive 
while  Grade  XI  and  DAT  NA  were  weighted  highly  negative.  The  verbal- 
enterprising  attribute  might  be  interpreted  in  terms  of  measures  of 
scholastic  achievement,  general  intelligence,  and  imagination  (repre¬ 
sented  by  Grade  X  average,  DAT  SR,  SCAT  3  -  quantitative)  variables  versus 
measures  of  numerical  ability  and  scholastic  achievement  (represented 
by  Grade  XI  average  and  DAT  NA)  variables. 

Although  the  predictor  variables  used  were  limited  to  areas  of 
aptitudes,  intelligence,  and  scholastic  achievement,  it  was  interesting 
to  note  how  discriminant  analysis  in  this  study  could  distinguish  dim¬ 
ensions  of  group  attributes  with  some  degree  of  precision.  The  differ- 
entation  of  attributes  shown  in  this  study  suggested  that  the  use  of 
more  predictors  which  measures  various  psychological  characteristics 
would  probably  achieve  more  distinct  identification  of  groups  than  the 


measures  used  here. 
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Analysl s  j6  (_2  outcome  categories ,  19  predictors)  .  This  analysis 
investigated  whether  the  19  predictors  used  in  this  study  could  different¬ 
iate  between  those  who  would  be  expected  to  graduate  and  those  who  would 
not.  The  criterion  samples  were  323  graduates  and  282  non-graduates. 

The  results  of  the  analysis  of  variance  for  testing  the  significance 
of  group  means  and  homogeneity  of  variance  of  the  19  predictors  are  shown 
in  Table  42.  Group  means  in  ten  predictors  were  not  significant  at  the  .05- 
level  while  nine  were  significant  at  the  .002-level.  Hypothesis  (1)  of 
no  difference  in  group  means  was  not  entirely  rejected.  This  preliminary 
analysis  showed  that  10  of  the  variables  would  not  differentiate  effective¬ 
ly  between  graduates  and  non-graduates. 

The  chi  square  values  of  12  predictors  were  significant  at  .06  > 
p  >  .87  except  for  7  predictors  which  were  significant  at  .04  >  p  >  .001. 

The  hypothesis  of  homogeneity  of  variance  was  not  entirely  tenable.  These 
results  indicated  the  possibility  of  large  overlap  between  groups  and 
thereby  increased  misclassif ication  of  persons. 

The  intercorrelations  of  the  19  variables  were  also  computed  and 
the  results  are  shown  in  Table  43.  Out  of  171  pairs  of  correlations  28 
pairs  obtained  correlations  over  .50.  These  pairs  were: 

1)  DAT  SR  and  DAT  AR,  DAR  MR,  Lorge-Thorndike  (NV)  ; 

2)  Lorge-Thorndike  (V)  and  DAT  VR,  LU  II  (grammar).  Read. -lit.  9  (Alberta), 
SCAT  3  (V) ; 

3)  Lorge-Thorndike  (NV)  and  DAT  AR; 

4)  Read. -lit.  9  (Alberta)  and  DAT  VR,  LU  II  (grammar).  Science  9  (Alberta), 
SCAT  3  (V) ; 

5)  Lang.  9  (Alberta)  and  LU  II  (grammar),  SCAT  3  (V),  Grade  X  average, 


. 
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Table  42. 


Analysis  6:  Means,  univariate  F-tests  and 
Chi  square  tests  of  homogeneity  of  variance 


Predictors 

Group  means*3 

1  2 

Fa 

P 

Chi 

Sq . 

P 

1 

DAT  VR 

3.71 

3.62 

2.72 

.099 

1.75 

.186 

2 

"  NA 

3.48 

3.39 

6.99 

.008 

12.70 

<.001 

3 

"  AR 

3.96 

3.94 

.08 

.783 

10.37 

.001 

4 

"  CSA 

5.43 

5.29 

3.29 

.070 

1.14 

.285 

5 

"  MR 

5.44 

5.35 

1.84 

.175 

3.49 

.062 

6 

"  SR 

3.90 

3.87 

.11 

.738 

.05 

.816 

7 

"  LU  I  (spell.) 

8.40 

8.43 

.19 

.661 

4.80 

.028 

8 

"  LU  II  (gram.) 

3.78 

3.73 

.60 

.440 

.96 

.327 

9 

Lorge-Thorn.  (V) 

6.15 

6.01 

3.77 

.053 

.03 

.868 

10 

"  "  (NV) 

5.08 

4.99 

1.55 

.213 

7.19 

.007 

11 

Read. -lit. 9  (Alta.) 

6.12 

5.81 

9.77 

.002 

.72 

.396 

12 

Lang.  9  (Alta.) 

6.17 

5.80 

12.71 

<.001 

.13 

.716 

13 

Soc.  stu.  9  (Alta.) 

6.56 

6.21 

13.54 

<.001 

4.26 

.039 

14 

Math.  9  (Alta.) 

6.58 

6.18 

14.86 

<.001 

4.44 

.035 

15 

Science  9  (Alta.) 

6.73 

6.33 

17.40 

<.001 

8.35 

.004 

16 

SCAT  3  (V) 

4.18 

4.06 

2.35 

.125 

.71 

.396 

17 

SCAT  3  (quant.) 

3.78 

3.62 

6.67 

.010 

.04 

.837 

18 

Grade  X  average 

3.53 

3.28 

33.78 

<.001 

.13 

.712 

19 

Grade  XI  average 

3.34 

3.06 

34.91 

<.001 

.63 

.427 

aDF  =  1  ;  DF  =  603 

1  -  Graduate  sample 

2  -  Non-graduate  sample 


Table  43.  Analysis  6:  Intercorrelation  and  criterion  means 
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Grade  X  average; 

7)  Math.  9  (Alberta)  and  DAT  NA,  Soc.  stud.  9  (Alberta) ,  Science  9 
(Alberta),  SCAT  3  (NV) ,  Grade  X  average; 

8)  SCAT  3  (V)  and  DAT  VR; 

9)  Grade  X  average  and  Science  9  (Alberta) ,  Grade  XI  average. 

As  a  whole  the  intercorrelations  were  low.  Positive  correlations  varied 
from  .002  to  .71  while  negative  correlations  ranged  from  -.004  to  -.16. 

The  high  correlations  of  these  pairs  were  likely  due  to  the  similarity  of 
the  characteristics  being  measured  by  these  variables.  This  also  in¬ 
dicated  that  some  variables  were  duplicated.  The  intercorrelations  showed 
that  the  predictors  might  differentiate  two  outcome  categories  of  NAIT 
students . 

The  single  root  obtained  in  the  discriminant  analysis  as  shown  in 
Table  44  was  0.11  which  was  much  lower  than  the  significant  roots  obtained 
in  the  previous  analyses  involving  9  and  8  training  groups.  The  value 
indicated  that  the  variance  along  the  discriminant  axis  was  small  and 
might  not  provide  sufficient  differentiation  between  two  groups. 

However,  in  testing  the  difference  in  group  centroids  the 
observed  F-ratio  of  3.39  was  significant  at  the  . 001-level.  The  generalized 
multivariate  null  hypothesis  (2) ,  that  the  characteristics  of  the  two 
outcome  categories  as  measured  by  the  19  predictors  were  the  same,  was 
rejected.  There  was  difference  between  the  two  group  centroids  and  this 
indicated  that  differentiation  between  the  two  categories  might  be 
possible. 

The  value  of  the  group  centroids  of  the  graduate  and  non-graduate 
samples  along  the  discriminant  axis  were  3.36  and  2.96  respectively.  The 


. 
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Table  44.  Analysis  6:  Root,  Chi  square  test  of 
dimensionality  of  discriminant  space, 
multivariate  F-test,  and  discriminant 
weights  in  one  function 


Root 

X 

DF 

Chi  sq 

£ 

1 

.1110 

19 

61.95 

<  .001 

Wilks  lambda 
.900 

19 

f? 

F 

3.39 

£ 

<  .001 

Predictors 

Discrim,  function  I 

Standard 

deviation 

(normalized  weights) 

Graduates 

Non-grads . 

1. 

DAT  VR 

-.055 

.649 

.701 

2. 

"  NA 

.144 

.378 

.465 

3. 

"  AR 

-.345 

.504 

.  606 

4. 

"  CSA 

.112 

.893 

.950 

5. 

"  MR 

.358 

.750 

.835 

6 . 

"  SR 

-.131 

1.053 

1.067 

7. 

"  LU  I  (spell.) 

-.270 

1.055 

.929 

8. 

"  LU  II  (grammar) 

-.202 

.771 

.728 

9. 

Lorge-Thorndike  (V) 

-.001 

.887 

.896 

10. 

"  "  (NV) 

.118 

.789 

.921 

11. 

Read. -lit.  9  (Alberta) 

.175 

1.144 

1.201 

12. 

Lang.  9  (Alberta) 

.090 

1.277 

1.304 

13. 

Soc.  stud.  9  (Alberta) 

-.047 

1.109 

1.250 

14. 

Math.  9  (Alberta) 

.018 

1.193 

1.347 

15. 

Science  9  (Alberta) 

.019 

1.087 

1.285 

16 . 

SCAT  3  (V) 

-.090 

.998 

.950 

17. 

SCAT  3  (quantitative) 

.038 

.729 

.738 

18. 

Grade  X  average 

.467 

.526 

.537 

19. 

Grade  XI  average 

.549 

.582 

.556 

;•* 
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variables  which  obtained  high  weights  were  Grade  X  average.  Grade  XI  aver¬ 
age,  DAT  MR,  DAT  AR,  DAT  LU  I  (spell.)  and  DAT  LU  I  (grammar).  These 
group  centroid  values  might  be  interpreted  as  technical  training  success 
continuum  along  the  discriminant  axis  as  measured  by  the  predictors  and 
represented  by  the  variables  above. 

The  centour  score  of  the  graduate  group  with  respect  to  the  non¬ 
graduate  group  shown  in  Table  45  was  51.  This  meant  that  the  graduate 
group  overlapped  with  the  non-graduate  group  by  51%  and  that  about  half 
of  both  categories  would  likely  be  misclassif ied . 

Discriminant  scores,  centours ,  Bayesian  probabilities,  and  class¬ 
ifications  using  the  common  dispersion  were  computed  for  259  subjects  in 
the  validation  sample.  The  results  of  the  classifications  are  summarized 
in  Table  46.  Of  the  138  graduates  in  the  validation  sample  both  classific¬ 
ation  rules  correctly  predicted  50%  to  59%  as  graduates  and  41%  to  59%  as 
non-graduates.  Of  the  212  non-graduates  in  the  validation  sample,  both 
classification  rules  correctly  predicted  55%  to  66%  as  non-graduates 
and  34%  to  45%  as  graduates.  The  overall  predicted  classifications  were 
57%  to  58%  correct  and  42%  to  43%  misclassified.  The  low  percentage  of 
correct  classifications  showed  that  it  was  not  possible  to  differentiate 
between  graduates  and  non-graduates  with  sufficient  efficiency  in  this 
analysis . 

Before  the  computations  of  discriminant  scores,  centours,  Bayesian 
probabilities,  and  classifications  were  carried  out,  the  preliminary 
univariate  F-tests  and  homogeneity  of  variance  tests,  the  low  value  of  the 
root  and  the  large  overlap  obtained  already  preclude  the  difficulties  of 
obtaining  adequate  differentiation  between  groups.  Perhaps  the  reason  for 
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Table  45.  Analysis  6:  Group  centroids  and  centours 
in  discriminant  space 


Group  Category 


Graduates 

Non-graduates 


Discrminant  function  I 


Group  centroid 


3.36 

2.95 


Group  centroid 

Centour 

of 

Graduates 

Non-graduates 

Graduates 

100 

51 

Non-graduates 

51 

100 

Table  46.  Analysis  6:  Classification  of  validation 

sample  of  graduates  and  non-graduates  using 
the  common  dispersion 


Type  of  Classification 


Group  category 

No.  of 

Conditional 

Bayesian 

Samples 

correct 

miss 

correct 

miss 

Graduates 

138 

69 

69 

82 

56 

(50%) 

(50%) 

(59%) 

(41%) 

Non-graduates 

121 

80 

41 

66 

55 

(66%) 

(34%) 

(55%) 

(45%) 

Total 

259 

149 

110 

148 

111 

Percent 

100% 

58% 

42% 

57% 

43% 
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these  difficulties  was  that  the  population  samples  used  were  already 
preselected  by  the  usual  admission  procedures  of  the  institute  and  were, 
therefore,  a  restricted  group.  Another  reason  was  that  the  causes  of 
non-graduation  are  not  the  same  for  all  students.  A  student  may  not 
graduate  not  because  he  lacks  aptitude  and  ability  for  the  training  but 
perhaps  because  the  course  lacks  challenge  or  because  he  found  a  job. 

He  could  have  graduated  had  he  stayed  and  finished  the  course. 

Possibly  the  admission  procedures  at  NAIT  were  performing  a  sat¬ 
isfactory  job  of  distinguishing  graduates  from  non-graduates.  However, 
the  results  of  analysis  of  8  training  groups  and  19  predictors  in  this 
study  would  likely  be  helpful  in  counseling  the  student’s  choice  of 
training  course  to  pursue  and  supplement  the  selection  procedures  of  the 
institute . 

Summary 

This  study  investigated  the  possibility  of  developing  a  procedure 
for  predicting  training  outcomes  of  graduates  at  NAIT  by  using  19  predictor 
variables.  As  a  preliminary  step,  a  one-way  analysis  of  variance  was 
done  to  ascertain  the  usefulness  of  each  variable  as  a  predictor.  For 
each  variable,  the  F-tests  of  the  difference  of  group  means  for  the  9 
training  groups  of  the  criterion  sample  all  were  found  significant  at  the 
.013-level.  The  results  showed  that  each  training  group  was  different 
from  the  other,  as  measured  by  the  variables. 

Another  preliminary  verification  was  made  by  examining  the  inter¬ 
correlations  of  the  19  variables.  It  was  found  that  the  intercorrelations 


were  generally  low. 
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The  F-ratio  approximations,  to  test  the  difference  of  group 
centroids  in  the  test  space  were  found  significant  at  the  .001-level  in 
all  discriminant  analyses  done  in  this  study. 

Five  discriminant  analyses  and  classification  solutions  were 
carried  out  on  the  graduate  sample.  Analysis  1  was  based  on  9  training 
groups  and  19  predictors.  Correct  classifications  in  each  training  group 
of  the  criterion  sample  accounted  for  36%  correct  prediction.  The  over¬ 
all  predicted  classification  was  73%  correct.  Correct  classifications 
in  each  training  group  for  the  validation  sample  accounted  for  25%  correct 
prediction.  The  overall  predicted  classification  was  61%  correct.  The 
classifications  were  computed  using  the  total  or  common  dispersion  as 
well  as  using  the  separate  dispersion  in  each  group  and  the  results  in 
both  solutions  were  found  comparatively  similar. 

Analysis  2  was  done,  involving  8  training  groups  and  19  predictors, 
to  examine  the  possibility  of  improving  the  number  of  correct  classifica¬ 
tions.  The  9  training  groups  were  reduced  to  8  groups  by  combining  groups 
1  and  2,  which  were  found  in  the  first  analysis  to  overlap  to  a  large 
extent.  Correct  classification  for  the  criterion  sample  increased  to 
38%  correct  prediction.  The  overall  predicted  classifications  increased 
to  76%  correct.  For  the  validation  sample  the  number  of  correct  class¬ 
ifications  increased  to  33%  correct  prediction.  The  overall  predicted 
classifications  also  increased  to  72%  correct.  The  procedure  of  com¬ 
bining  similar  groups  which  overlapped  considerably  was  found  to  improve 
correct  classification.  Again  classifications  computed  using  the  separate 
dispersion  in  each  group  were  found  similar  to  the  results  computed  using 
the  common  dispersion  of  all  groups.  In  subsequent  analyses  the  classific¬ 
ation  solution  was  computed  using  the  common  dispersion. 
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Analysis  3  was  carried  out,  involving  9  training  groups  and  10 
predictors  which  were  found  highly  weighted  in  the  first  analysis,  to 
verify  the  results  of  classification  when  the  number  of  predictors  were 
reduced.  Correct  classifications  for  the  criterion  samples  accounted 
for  34%  correct  prediction.  The  overall  predicted  classifications 
were  69%  correct.  Correct  classifications  of  the  validation  samples 
accounted  for  29%  correct  prediction.  The  overall  predicted  classifica¬ 
tions  were  65%  correct.  As  a  whole  these  results  were  slightly  lower 
than  analysis  1. 

Analysis  4  was  done  involving  8  training  groups  and  10  highly 
weighted  predictors  to  verify  the  results  of  classification  when  groups 
1  and  2  were  combined  into  one  group.  Predicted  classifications  of  the 
criterion  samples  accounted  for  33%  correct.  The  overall  predicted 
classifications  increased  to  72%  correct.  In  the  validation  samples 
the  number  of  correct  classifications  increased  to  32%  correct  pre¬ 
diction.  The  overall  predicted  classifications  were  72%  correct.  The 
results  of  this  fourth  analysis  indicated  the  possibility  of  using  a 
fewer  number  of  predictors  than  19  although  prediction  would  probably 
be  less  accurate. 

At  NAIT,  the  scores  of  the  DAT,  Lorge-Thorndike  Intelligence  Tests, 
and  the  high  school  averages  in  Grades  X  and  XI  were  readily  available 
to  the  vocational  counselor.  In  the  previous  analyses,  almost  all  of 
these  variables  were  found  highly  weighted.  Analysis  5  was  carried  out 
involving  8  training  groups  and  the  12  predictors  mentioned  above.  Cor¬ 
rect  prediction  of  the  criterion  sample  accounted  for  36%  correct.  The 
overall  predicted  classifications  were  74%  correct. 
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In  the  validation  sample,  the  number  of  correct  classifications 
accounted  for  34%  correct  prediction.  The  overall  predicted  classifica¬ 
tions  were  65%  correct.  The  results  of  the  fifth  analysis  showed  that 
using  8  groups  and  12  variables  performed  fairly  well  as  compared  with 
analysis  4,  but  somewhat  lower  compared  with  analysis  2. 

The  prediction  weights  in  analysis  2  were  used  on  subjects  in 
the  non-graduate  sample  to  verify  how  far  it  would  classify  a  different 
population  category.  The  results  were  just  the  opposite  to  those  ob¬ 
tained  in  analysis  2.  The  percentages  were  similar  but  in  reverse  order. 
The  cross-classification  results  further  showed  that  the  prediction  scheme 
developed  in  this  study  could  be  a  useful  tool  for  vocational  counseling 
at  NAIT.  An  attempt  was  made  to  interpret  the  meaning  of  the  distri¬ 
bution  of  scores  of  the  NAIT  graduate  criterion  sample  in  the  three- 
dimensional  discriminant  space.  The  first  discriminant  function  was 
interpreted  as  measuring  mechanical  characteristics  of  the  students, 
while  the  second  function  was  measuring  intellectual  characteristics, 
and  the  third  function  as  measuring  verbal  enterprising  characteristics. 

As  a  whole  the  results  showed  that  the  19  predictor  variables 
could  possibly  be  used  in  predicting  training  outcomes  of  graduates  at 
NAIT  with  fairly  good  results.  Twelve  highly  weighted  variables  in  this 
study  could  probably  perform  as  well  as  the  19  predictors  but  prediction 
efficiency  might  be  a  little  less. 

A  sixth  analysis  was  carried  out  involving  two  outcome  categories 
and  19  predictors  to  investigate  whether  it  was  possible  to  differentiate 
adequately  between  graduates  and  non-graduates.  The  results  showed  that 
this  analysis  correctly  predicted  58%  and  misclassif ied  42%  of  the  valid¬ 
ation  sample.  It  was  concluded  that  this  analysis  was  not  efficient 
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enough  to  differentiate  between  graduates  and  non-graduates.  One 
given  was  that  the  samples  used  were  already  preselected  and  were 
fore,  a  restricted  group. 
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CHAPTER  VI 


DISCUSSION 

Summary 

In  this  study,  nineteen  predictor  variables  were  used  to  develop 
a  procedure  for  predicting  training  outcomes  of  graduates  of  the 
Northern  Alberta  Institute  of  Technology.  The  problem  was  to  determine 
whether  a  combination  of  19  predictor  variables  could  discriminate  9 
training  groups  of  graduates.  Furthermore,  the  study  investigated 
various  modifications  which  might  improve  the  effectiveness  of  classifica¬ 
tion  of  persons  to  different  training  groups. 

The  discriminant  analysis  was  used  to  determine  the  reduced  space 
in  which  the  separation  of  training  groups  was  maximum.  To  determine 
the  probability  that  a  person,  having  a  particular  set  of  scores,  was  a 
member  of  a  certain  training  group,  the  centour  classification  solution 
was  computed  for  that  person.  The  effectiveness  of  classification  was 
verified  from  a  stratified  random  sample  of  graduates  whose  scores  were 
set  aside  for  this  purpose  at  the  beginning  of  this  study.  It  was  also 
verified  from  the  criterion  sample  which  was  used  in  the  analysis.  The 
results  of  the  classification  were  tabulated  in  three  categories:  correct, 
close,  and  fairly  close  classifications.  Both  conditional  and  Bayesian 
rules  of  classification  were  applied  to  determine  the  training  group  to 
which  a  person  would  most  probably  belong.  In  evaluating  the  effective¬ 
ness  of  classification,  comparisons  were  made  in  terms  of  percentages 
of  predicted  correct,  close,  fairly  close,  and  overall  classifications. 
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Five  discriminant  analyses  and  classification  solutions  were 
carried  out  on  the  graduate  sample.  In  this  study,  analysis  2  involving 
8  training  groups  and  19  predictor  variables,  produced  the  best  results 
of  classification. 

The  effectiveness  of  analysis  2  was  further  verified  by  applying 
the  prediction  weights  of  this  analysis  to  a  sample  of  non-graduates.  The 
results  were  opposite  to  those  in  analysis  2.  This  further  indicated 
that  the  prediction  scheme  developed  in  this  study  could  be  a  useful 
tool  for  vocational  counseling  at  NAIT. 

Several  modifications  of  the  analysis  were  made,  such  as  reducing 
the  number  of  training  groups  to  minimize  large  overlaps  of  groups , 
limiting  the  number  of  predictor  variables  to  those  which  were  weighted 
high,  and  carrying  out  analyses  using  both  limitations.  Reducing  the 
number  of  training  groups  was  found  to  improve  the  number  of  correct 
classifications  but  it  did  not  always  improve  accuracy.  Limiting  the 
predictor  variables  to  those  which  were  highly  weighted  showed  lower 
correct  classifications  than  using  all  the  predictor  variables.  Analysis 
was  done  on  12  predictors  which  were  readily  available  to  vocational 
counselors  at  NAIT.  The  results  were  also  lower  than  using  19  variables 
but  would  probably  serve  the  purpose  of  vocational  counseling  although 
with  a  lesser  degree  of  effectiveness. 

The  question  whether  the  19  predictors  could  differentiate  between 
graduates  and  non-graduates  was  investigated.  The  results  showed  that  it 
was  not  possible  to  distinguish  between  these  two  categories  accurately 
enough  for  use  in  counseling. 
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Interpretations 

The  nineteen  predictor  variables  did  discriminate  8  training 
groups  fairly  well.  However,  it  was  possible  in  this  study  to  reduce 
the  number  of  predictors  by  using  those  with  high  discriminant  weights 
and  still  obtain  fair  results.  One  analysis  showed  that  prediction 
based  on  10  predictor  variables  were  only  slightly  less  accurate  than 
those  based  on  the  entire  19  variables.  This  indicated  that  some  var¬ 
iables  were  somewhat  duplicated  and  were  measuring  characteristics  similar 
to  the  other  variables.  From  the  discriminant  analysis  one  might  be 
able  to  give  a  parsimonous  interpretation  of  what  was  being  measured  by 
the  19  variables.  In  this  study,  the  first  discriminant  function  was 
interpreted  as  measuring  the  mechanical  domain,  while  the  second  function 
the  intellectual  domain,  and  the  third  function  the  verbal-enterprising 
domain.  Much  improvement  could  still  be  made  in  the  selection  of  pre¬ 
dictor  variables  over  those  used  in  this  study.  Predictor  variables 
which  measured  specific  aptitudes  of  training  fields  or  training  courses 
and  other  attributes  would  probably  further  increase  the  discrimination 
of  groups. 

The  large  overlaps  of  groups  1,  2,  3,  5,  and  9  in  the  first  analy¬ 
sis  indicated  that  misclassif ications  would  occur  in  these  groups.  This 
was  confirmed  by  the  results  of  later  analyses  and  classifications.  Con¬ 
sidering  the  general  aptitude  and  training  requirements  in  groups  1,  2, 

3,  4,  and  5  which  were  largely  mechanical,  it  would  appear  that  these 
groups  were  somewhat  similar.  Could  distinction  of  these  groups  be  further 
improved?  Further  investigation  on  this  question  was  needed  using  other 
predictors  that  could  measure  certain  specific  attributes  of  persons 
belonging  to  these  training  fields  and  perhaps  secure  a  much  clearer 
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differentiation  than  this  study  was  able  to  achieve.  As  stated  earlier, 
the  procedure  of  combining  groups  could  not  be  overdone  without  reducing 
the  usefulness  of  the  prediction  scheme  for  counseling. 

When  groups  1  and  2  were  combined  in  the  second  analysis ,  the 
overlapping  of  groups  was  reduced  slightly,  but  overlaps  still  occurred 
in  groups  3  and  5.  Again  the  results  showed  that  these  two  groups  were 
not  well  classified.  The  vocational  counselor  should  perhaps  be  cautious 
in  using  the  classifications  named  under  these  two  groups. 

The  decision  on  the  a_  priori  grouping  of  training  fields  and  the 
kinds  of  predictor  variables  to  be  used  were  important  factors  in  this 
prediction  scheme.  Each  group  should  be  distinct,  as  measured  by  the 
predictor  variables  in  order  to  secure  less  overlap  among  groups.  Also, 
it  was  important  that  the  sample  size  in  each  group,  both  those  used  in 
the  analysis  and  in  the  validation,  should  be  large  enough  to  insure 
normality  and  homogeneity  of  groups. 

In  the  conditional  rule  of  classification,  a  person  was  classified 
into  a  certain  group  when  the  centour  or  probability  he  obtained  for  that 
group  was  the  highest  among  the  groups.  The  vocational  counselor  should 
exercise  some  caution  in  interpreting  such  classifications.  Cooley 
and  Lohnes  (1962)  pointed  out  that  this  rule  assumed  that  membership  in 
one  group  occurred  as  often  as  that  of  other  groups.  When  membership 
in  one  group  occurred  more  often  than  the  second,  this  rule  tended  to 
classify  persons  to  the  second  group.  The  rule  also  assumed  that  the 
dispersions  in  each  group  were  equal.  When  the  dispersion  of  one  group 
was  greater  than  that  of  a  second  group,  this  rule  tended  to  classify 
persons  into  the  first  group.  However,  the  classification  results 
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computed  using  the  total  dispersion  and  using  the  separate  dispersion 
in  each  group  was  found  to  give  similar  results  in  this  study.  This 
indicated  that  dispersions  of  groups  in  this  study  were  somewhat  similar 

The  Bayesian  classification  rule  identified  the  group  to  which  a 
person  most  probably  belonged.  The  solution  under  this  rule  considered 
and  minimized  the  possibilities  of  inequalities  of  group  membership  and 
group  dispersion  thereby  minimizing  mis  classification.  Therefore,  it 
was  a  better  rule  to  apply  than  the  conditional  rule.  However,  as 
noted  in  this  study,  if  the  counselor  considers  the  classification  ob¬ 
tained  by  applying  both  rules  he  cannot  be  far  from  correct  in  naming 
the  training  group  to  which  a  student  might  probably  belong. 

Centours  and  probabilities  for  counseling 

Chart  1  is  a  sample  computer  print-out  of  the  classification 
taken  in  analysis  2  (8  training  groups,  19  predictors).  The  six-digit 
number  (240603)  at  the  top  of  the  left  column  represents  the  student 
ID  (24)  ,  the  specific  technology  for  which  the  student  applied  (06)  ,  and 
the  training  field  initially  chosen  (03).  The  student’s  discriminant 
vector  scores  in  the  three  functions  are  shown  on  the  first  line.  Chi 
square  values  of  this  discriminant  score  vector,  for  each  training  group 
are  shown  on  the  second  line.  Posteriori  probabilities  of  each  training 
group,  based  on  the  Bayesian  rule,  are  displayed  on  the  third  line  while 
the  fourth  line  shows  the  conditional  probabilities  obtained  from  the 
chi  square  values. 

The  highest  posteriori  probability  for  student  No.  24  is  0.22. 
Therefore,  by  the  Bayesian  classification  rule,  this  student  resembled 
graduates  of  training  field  1  (industrial-control  sys.).  His  highest 
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Chart  1.  Sample  Classification  print-out  of  the  computer 
(taken  from  analysis  2) 
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conditional  centour  (probability)  is  0.96.  By  the  conditional  rule  this 
student  resembled  graduates  of  training  field  2  (natural  resources). 

If  the  initial  choice  of  this  student  was  training  field  3  (engineering) 
then  his  group  3  posteriori  probability  of  0.18  was  fairly  close  to  the 
Bayesian  classification  of  0.23  (group  1)  and  his  group  3  probability 
of  .94  was  close  to  the  conditional  classification  of  0.96  (group  2). 

The  vocational  counselor  may  therefore  attempt  to  counsel  the 
student  whose  initial  choice  is  training  field  3  (engineering)  by 
explaining : 

1)  that  he  resembles  less  closely  students  who  satisfactorily  com¬ 
pleted  training  field  3  (engineering) ; 

2)  that  he  resembles  students  who  satisfactorily  completed  either  train¬ 
ing  field  1  (industrial  control  sys.)  or  training  field  2  (natural  re¬ 
sources)  ; 

3)  that  he  very  closely,  closely,  and  less  closely  resembles  students 
who  satisfactorily  completed  training  fields  1,  2,  and  3  respectively. 

In  counseling  student  No.  25,  whose  choice  is  engineering  (field  3), 
the  vocational  counselor  may  try  to  make  the  student  aware: 

1)  that  he  resembles  students  who  satisfactorily  completed  training 
field  3; 

2)  that  he  may  want  to  consider  either  training  field  2  or  training 
field  8  which,  compared  with  students  who  satisfactorily  completed  these 
training  fields,  he  resembles  closely  and  less  closely  respectively. 

Student  No.  26  resembled  students  who  satisfactorily  completed 
training  field  8  and  3,  while  student  No.  27  resembled  students  who 
satisfactorily  completed  training  field  2. 
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From  the  conditional  and  Bayesian  classifications  shown  in  the 
print-out,  the  vocational  counselor  may  counsel  incoming  students  to  con¬ 
sider  three  alternative  training  fields.  However,  counselors  should 
not  lose  sight  of  the  fact  that  the  final  decision  is  to  be  made  by  the 
student . 

Implementation  of  the  prediction  scheme 

It  was  shown  that  analysis  2  (graduates)  gave  fairly  good  class¬ 
ifications.  Also,  that  was  an  indication  that  the  group  centroids,  total 
dispersion,  and  transformation  matrices,  which  were  derived  in  this  anal¬ 
ysis  could  be  used  as  bases  for  computing  centours  (probabilities)  and 
classifying  incoming  students.  However,  the  input  data  for  each  student 
must  contain  scores  of  all  19  predictors. 

In  the  event  that  the  scores  of  the  Alberta  Grade  9  departmental 
examinations  and  SCAT  Level  3  which  constituted  the  7  of  the  19  predictor 
variables  in  this  study  would  not  be  available  and  since  the  scores  of 
the  12  predictors  were  easily  accessible  to  vocational  counselors  at 
NAIT,  the  matrices  in  analysis  5  could  be  used  instead  of  those  mentioned 
above.  As  in  the  previous  case,  input  data  for  each  student  must  con¬ 
tain  scores  for  each  of  the  12  predictors. 

The  sanea  priori  probabilities  may  be  used  in  computing  the 
classification  or  perhaps  new  a.  priori  probabilities  may  be  established 
based  on  a  breakdown  of  the  proposed  enrollment  of  new  students  for  the 
school  term. 

Limitations  of  the  study 

There  were  several  drawbacks  in  using  data  designed  for  purposes 
other  than  this  study.  Particularly  outstanding  was  the  requirement  of 
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a  complete  set  of  observations  on  each  student  for  all  predictor 
variables.  Thus,  several  students  had  to  be  dropped  from  the  study  due 
to  an  incomplete  set  of  scores.  However,  the  sample  was  large  enough  to 
enable  the  investigator  to  secure  a  sufficient  number  of  cases  to 
work  with. 

Another  weakness  of  this  study  was  that  group  sample  sizes  were 
quite  small.  The  investigator  combined  several  training  courses  in 
order  to  secure  adequate  group  sizes.  Also,  the  sizes  of  the  validation 
samples  were  too  small  to  provide  a  rigorous  validation  of  the  results. 
However,  in  spite  of  this  weakness,  the  study  showed  that  the  predictors 
could  discriminate  fairly  well  between  different  training  groups. 

The  choice  of  predictor  variables  was  also  another  weakness  of 
this  study.  Some  variables,  as  shown  in  the  analyses,  were  somewhat 
duplicated  by  other  variables  and  thus  contributed  little  to  prediction. 
Ten  predictors  out  of  19  appeared  to  contribute  greatly  to  the  discrimin¬ 
ation  of  different  groups. 

However,  this  would  not  imply  that  only  a  few  predictors  were 
needed  in  this  type  of  prediction  scheme.  Instead,  from  the  interpre¬ 
tation  on  how  the  scores  of  the  sample  of  graduates  were  distributed  in 
the  reduced  discriminant  space,  it  indicated  that  the  use  of  a  large 
number  of  predictors  would  improve  differentiation  of  groups.  What 
was  important  was  the  selection  of  appropriate  predictors  to  measure 
the  various  psychological  domains  of  a  given  population  under  study. 
Furthermore,  this  study  showed  that  19  predictors  differentiated  and 
classified  better  than  10  or  12  predictors. 
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Implications  for  research 

This  study  could  be  a  start  of  a  series  of  studies  along  this 
line  at  NAIT,  as  well  as  in  other  technological  institutes.  Validation 
studies  may  be  conducted  using  new  samples  to  verify  the  usefulness  of 
this  prediction  scheme. 

A  2-year  experiment  might  be  designed  by  taking  two  groups  of 
new  students.  One  group  could  consist  of  students  who  choose  their 
technologies  by  the  usual  procedures  prescribed  at  NAIT  while  the  other 
group  could  consist  of  students  who  choose  their  technologies  through 
the  assistance  of  a  vocational  counselor  who  uses  classification  based 
on  the  prediction  scheme  of  this  study.  Two  years  later,  comparisons 
of  outcomes  could  be  made  to  find  out  which  procedure  of  choosing  tech¬ 
nologies  would  result  in  a  large  proportion  of  graduates. 

A  validation  study  could  be  conducted  on  a  new  sample  of  graduates 
using  the  data  derived  in  this  study  and  verifying  the  results  of  this 
study.  Other  predictors,  such  as  the  Strong  Vocational  Interest 
Blank,  the  Kuder  Preference  Record  could  perhaps  be  used  with  the  highly 
weighted  variables  found  in  this  study  and  extending  the  criteria  not 
only  to  training  outcomes  but  also  job  success  after  leaving  school. 

Also,  research  might  be  carried  out  to  develop  or  locate  measures  which 
could  distinctly  discriminate  between  different  training  fields.  The 
development  of  procedures  for  determining  job  families  still  needs 
further  investigation. 

Conclusions 

1.  The  null  hypothesis  that  there  were  no  significant  differences 
among  group  means  in  the  sub-scores  of  the  Differential  Aptitude  Tests, 


’ 
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the  Lorge-Thorndike  Intelligence  Tests,  the  Grade  Nine  Alberta  Depart¬ 
mental  Examinations,  SCAT  Level  3,  and  high  school  averages  of  groups  of 
NAIT  students,  classified  on  an  a  priori  basis,  was  rejected.  There 
were  differences  in  group  means  among  the  training  groups.  This  in¬ 
dicated  that  the  variables  in  this  study  were  useful  in  differentiating 
various  groups. 

2.  The  generalized  multivariate  null  hypothesis  that  character¬ 
istics  of  various  training  groups,  as  measured  by  the  nineteen  predictors, 
were  the  same,  was  not  tenable.  Group  centroids  were  found  to  be  sig¬ 
nificantly  different.  This  further  indicated  that  the  variables  could 

be  effective  predictors  for  identifying  membership  in  certain  training 
groups  of  students  enrolled  at  NAIT. 

3.  Three  roots  were  found  significant  and  were  used  in  deter¬ 
mining  the  dimensionality  of  the  discriminant  space  of  the  graduate  sample. 
It  was,  therefore,  possible  to  describe  the  separation  of  groups  along 

the  discriminant  functions  in  these  reduced  spaces. 

4.  Results  of  the  classification  solutions  showed  that  ident¬ 
ification  of  membership  of  persons  to  certain  training  groups  gave  a 
fairly  good  percentage  of  correct  predictions  well  beyond  the  probability 
of  1  in  8  or  .125. 

5.  Psychological  meanings  were  derived  from  the  distribution  of 
scores  of  the  NAIT  graduate  criterion  sample  in  a  three-dimensional 
discriminant  space.  The  first  discriminant  function  was  interpreted  as 
measuring  mechanical  characteristics  of  the  students,  the  second  function 
as  measuring  intellectual  characteristics,  and  the  third  as  measuring 
verbal-enterprising  characteristics . 

6.  The  discriminant  analysis  and  classification  involving  2 
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outcome  categories  and  19  predictors  showed  that  it  was  not  possible  to 
differentiate  with  sufficient  accuracy  between  graduates  and  non-graduates. 
However,  the  results  of  the  analysis  involving  8  training  groups  and  19 
predictors  would  likely  be  helpful  in  counseling  the  student’s  choice  of 
training  program  to  pursue  and  could  supplement  the  selection  procedures 
of  the  institute. 

In  conclusion,  the  main  objective  was  to  develop  a  procedure  for 
predicting  training  outcomes  of  graduates  at  NAIT,  based  on  19  predictor 
variables.  This  study  demonstrated  that  the  19  predictors  may  be  analyzed 
to  provide  a  workable  prediction  scheme  for  purposes  of  vocational 
counseling.  Thus,  it  is  possible  to  develop  prediction  schemes  for  use 
in  technological  institutes  as  well  as  in  high  schools. 
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APPENDIX  I 


Mathematical  concept  of  multiple  discriminant  analysis 


The  general  description  of  the  multiple  discriminant  analysis 
as  described  by  Tiedeman,  Byran,  and  Rulon  (1953)  is  quoted  in  full 


below: 


Let  X  .  denote  the  value  of  the  i 
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G)  where  G  is  less  than  n.  Without  loss  of  generality  we  may  define 


the  X  .  so  that 
PgJ 


Z  Z  X  .  =  0 
g  P  PgJ 


(2.1) 


for  all  values  of  j 


X 


In  general,  separate  group  means  such  as 
1 


gJ 


N 


g 


2  X  . 

P  PgJ 


(2.2) 


do  not  vanish. 


A  linear  function  of  the  X.  such  as 
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where  y  is  the  value  of  the  linear  function  for  the  p —  individual  in 
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the  g —  group.  Because  of  the  restrictions  defined  by  equation  (2.1), 


the  among  means  of  groups  sum  of  squares  of  y  is 
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The  within  groups  sum  of  squares  is 


aria  :o'i  to  aonul  nsrrJrl  a ri  t  \o  9ult  v  aria  a±  v  siadw 


143 


Z  Z  (y  -  y)2 
g  P  Pg 


(2.6) 

3 

The  coeficients,  v..,  v_,  ...,  v  of  y  are  defined  by  Fisher’s 

12  n 

criterion  of  discriminant  analysis,  i.e. ,  by  maximization  of  the  ratio 


A  = 


Z  N  y 

&  .  g.  g 


Z  Z(y  -  y)2 
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(2.7) 


The  function  A,  has  several  extrema  each  of  which  is  indicative 

of  a  distinct  dimension  of  the  subspace  defined  by  the  group  means.  All 

4 

discriminant  functions  are  obtained  from  the  same  initial  ratio  A. 


Maximization  of  A  is  accomplished  by  a  combination  of  two  effects, 
increase  in  the  among  means  of  groups  sum  of  squares,  and  decrease  in 
the  within  groups  sum  of  squares.  Therefore,  the  linear  functions  de¬ 
fined  by  the  discriminant  analysis  criterion  provide  as  much  separation 
of  group  centroids  as  is  possible  within  the  restriction  of  having  the 
groups  as  homogeneous  as  possible.  The  utility  for  group  classification 
of  functions  with  this  property  is  obvious. 

Manipulation  of  the  quadratic  forms  of  the  variates  required  for 
discriminant  analysis  is  facilitated  by  use  of  matrix  algebra.  There¬ 


for  let  us  define  the  symmetrical  matrices 
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a.  . 
ij 


,  a..  =  ZN  X  .X  .  =  a..,  (i,  j  =  1,  2,  ...,  n) ; 
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(2.9) 


and  the  column  vector 


v  =  [v  ]  ,  (j  =  1,  2,  .  . .  ,  n) . 


(2.10) 


In  this  notation,  equations  (2.5),  (2.6),  and  (2.7)  may  be  written  as 
_  2 

EN  y  =  v'Av  (2.5) 

g  g 


and  A  = 


ZE(y  -y)2=v'Wv 
g  P  Pg  g 

v'Av 


v  'Wv 


(2.6) 

(2.7) 


It  has  been  shown  that  the  matrix  equation  defining  the  vector 


v  is 


(v'Wv)  Av  -  (v'Av)  Wv  =  0 
This  may  be  rewritten  as 
(A  -  AW)  v  =  0 
Defining 

R  ^  W_1  A, 
this  becomes 

(R  -  AI) v  =  0  (2.11) 

where  I  is  the  unit  matrix. 

The  coefficient  of  the  discriminant  functions  are  determined  by 
the  latent  vectors  of  R,  and  the  corresponding  latent  roots  of  R  equal 
the  respective  ratios  of  among-groups  to  within-groups  sums  of  squares. 

By  considering  the  rank  of  the  matrix  A,  it  is  a  simple  matter  to  show 
that  the  number  of  solutions  of  (2.11),  such  that  A  /  0,  is  at  most 


See  for  instance:  J.  G.  Bryan,  ibid. ,  pp.  131-132. 


vem  (*.£)  bnB  ,  (d.S)  «(i^>  enoiDBups  .noiJfion  slriJ  oJ 


§ 


■ 


«Ci  .S)  1>j  anoint .  y»  la  itxinum  Jb 
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equal  to  the  smaller  of  the  two  integers  G  -  1,  and  n.  Consequently, 
letting  r  stand  for  the  smaller  number,  the  total  discriminative  power 
of  the  variates  is  exhausted  by  r  linear  functions  defined  in  the  manner 
stated.  Among  these,  all  functions  corresponding  to  distinct  values 
of  A  are  uncorrelated  as  they  stand.  Repeated  roots  other  than  zero  are 
possible  but  unlikely  to  occur.  If  one  or  more  multiple  roots  should 
occur,  however,  the  vectors  corresponding  to  any  one  of  them  are  already 
uncorrelated  with  the  vectors  corresponding  to  all  different  roots  and 
can  be  chosen  in  such  a  way  as  to  be  uncorrelated  among  themselves.  The 
numerical  values  of  these  functions  are  independent  of  the  origin  of 
coordinates,  the  units  of  measurement,  and  in  fact  independent  of  any 
non-singular  linear  transformation  of  the  variates. ^ 


J.  G.  Bryan,  ibid.  , 


pp.  138-139. 
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LEVEL  IS  (  1  JULY  SSI 


OS/360  FORTRAN  H 


OATF  69.172/05.04.20 


COMF1 LFH  OPTIONS 


mulvid 


PURPOSE: 


M  A  IN,DPT  =  02,L  I  NECNT  =  59,SOl)RCE,EBCDIC,  NOLI  ST,  NODFCK.L  DAO,  NOMAP,  NOEDIT, 
VISION  OF  E  OlIC  AT  I  ON  Al  RFSFARCH  SERVICES 


NA«F  = 

DIVtSIUN  OF  EDUCATIONAL  RESEARCH  SERVICES 
UNIVERSITY  OF  ALBERTA 


ID*  NOXREF 


CARRIFS  OUT  DISCRIMINANT  ANALYSIS  AND  ONE  WAY 
MULTIVARIATE  ANALYSIS  OF  VARIANCE,  AND  ALSO  GIVES  CARD 
OUTPUT  FOR  CLASSIFYING  SUBJECTS  BY  USING  MULV03 

CARD  INPUT: 

1. TITLEI20A4I 

2. PARAMFTFRS!  16I5):NG»NV,NT  *NO,(NCELL( II *I  =  l*NG) 

3.  FORMAT  FOR  OATA  CARDS  I 20A4 I 

4.  DATA  CARDS 

5.  A  RLANK  CARD 
DESCRIPTION  OF  PARAMETERS: 

NO:  NO  OF  GROUPS 

NV:  NO  OF  VARIABLES  PER  OBSERVATION 
NT:  NO  OF  TOTAL  OBSERVATIONS 

NO:  TYPE  1  IF  CARD  OUT  PUT  FOR  MIJLVU  REQUIRED. 

NCFLLII):  NO  OF  OBSERVATIONS  IN  I  TH  GROUP 

remark  S: 

1. NV  and  ng  ape  limited  UP  TO  50 

2.  EACH  GROUP  SHOULD  HAVE  AT  LEAST  NV  OBSERVATIONS  I.E., 

NC  ELL  I  I  I  SHOULD  NOT  BE  LESS  THAN  NV 

PROGRAMMER  k.bay 

SUBPROGRAMS  F I SHER , HOMOO I S , MXOUT , DA T A , R AOFT , ROYTE S , DATR AN , S I GD I S 

SSP  CORR E, GM ADD, GMPRDtGMSUB.MCPY, MI NV,C SUM, LOC.NROOT, EIGEN 


I  SN 

000? 

DIMFNSION  FMT( 201 ,TI TLEI 20) , XI 501 ,NCELL( 50) , XBAR I 50 1 , STDI 50  I , 
1RR(50I,D0(50I,TT(50),LL(50),MM(50I ,COR( 1300) 

I  SN 

0003 

DIMENSION  R( 50, 50), W(2500I,H(2500I,E(50, 501, 0(2500), RXI2500I 

I  SN 

0004 

100 

FORMAT ( 20A4 I 

I  SN 

0005 

101 

FORMAT! 1 H 1 , 10X.20A4) 

r  sn 

0006 

102 

FORMAT!  1615  I 

I  SN 

0007 

103 

FORMAT! IX, 'NO.  OF  GROUPS  = ' , 22X , I 5, / , IHO , • NO  OF  VARIABLES  PFR  ORSE 
1 RVAT ION  =• ,4X, 15,/, IHO, 'TOTAL  NO  OF  OBSERVATIONS  =',10X,I5> 

I  SN 

000« 

104 

FORMAT! 1H0, 'FORMAT  FOR  DATA  :',20A4) 

T  SN 

0009 

105 

FORM AT( /,/, IHO, 'GROUP: ', 13, 4X, ' NO  OF  OBSERVATION  IN  THIS  GROUP', 15 
1 1 

FORMAT! IH1, 'BARTLtTTS  HOMOGENI TY  OF  DISPERSION  TEST  SEE  COOLEY  P 

ISN 

0010 

106 

162'  ) 

ISN 

001  1 

107 

FORMAT! 1H1, 'WILKS  L  A  MU A  TEST  SEE  COOLEY  P  61*1 

ISN 

001  ? 

10B 

FORMAT! IX, ' DETERMINANT  OF  W=',F16.8) 

ISN 

0013 

1  00 

FORMAT! I X, • OF  1  = ' ,F5.0,3X, 'DF2=' ,F9.0,3X, 'F-RATIO  =',E14.6,3X, 

1 'PRO PAG IL  1 T  Y  — ' ,F9.6, 3X,  ' L A  MDA= ' ,F9.6) 

I  SN 

C  014 

110 

FORMAT! 1H1, 'MAXMUM  LATENT  ROOT  APPROACH  SEE  MORRISON  P168') 

ISN 

0015 

ill 

FORMAT! IHO, 'S  =‘,I5,3X,'M  = ' ,F9 . 1 , 3X , • N  =• , F9. 1 ,3X , • HECK  =',F9.6) 

ISN 

0016 

112 

FORMAT (IHO, '  CONSULT  HECKS  CHART  ON  MORRISON  P  312' 1 

I  SN 

0017 

113 

FORMAT! 5F 1 b.9 1 

ISM 

00l« 

114 

FORMAT! 1H1, 'DISCRIMINANT  ANALYSIS') 

T  SN 

0019 

M  A  X  =  50 

ISN 

0020 

MAXM-MAX*MAX 

I  SN 

0021 

1 

RFADI5, 100,FND=9999)  TITLF 

ISN 

00?? 

IF  ( TITLE! 1 ) .EQ.T ITLE ( 2) )  GO  TO  9999 

I  SN 

00?4 

WRITE! 6, 101)  TITLE 

I  SN 

00?  5 

READ! 5, 102)  NG,NV,NT,ND, (NCELL! 1 ) , I=1,NG) 

Ir.N 

0026 

WRITE! 6, 103)  N  G , N  V , N  T 

I  SN 

0027 

READ  ( 5 , 1 00 ) FM T 

I  SN 

002ft 

WR  I  TL‘ (6, 104)  FMT 

I  SN 

0029 

00  2  1  =  l , MA  X  M 

I  SN 

00  3  0 

2 

M I  1=0.0 

I  SN 

0031 

Hll  OGS  -O.O 

T  SN 

0032 

REWIND  2 

T  SN 

0033 

DO  3  J=1,NT 

I  SN 

0034 

RFAO! 5, FMT 1  ( X ( I ) , I = 1 , NV ) 

ISN 

0035 

CALL  QATRAN  (NV,X> 

ISN 

0036 

3 

WRITE!?)  1 X! I 1 , 1=1 ,NVI 

I  SN 

0037 

REWIND  2 

ISN 

0038 

NVV  =  NV»NV 

I  SN 

003° 

DO  5  NNG= 1 , NG 

I  SN 

0040 

NN -NCEL  1  ( NMG ) 

I  SN 

004  1 

WHITE(6,105)  MNG.NN 

I  SN 

004? 

CAL!  CORRF  ( NN , N V , 0 , 0 . 0 , X B AR , ST D , R X , COR , B B , DD , T T ) 

ISN 

0043 

IF(ND.EQ.l)  WRITEI7.113)  ( XBAR ( I ) , I = 1 , NV ) 

I  SN 

0045 

CALL  MXOUT! XBAR , 1 ,NV, 0,60 , 132, 1 , 24, 24H  MEANS  FOR  THIS  GROUP  ) 

I  SN 

0046 

CALL  MXOUT! STD, 1 ,NV, 0,60, 132, 1 , 36, 36H  STANDARD  DEVIATIONS  FOR  THIS 

1  GROUP  ) 

I  SN 

0047 

CAL!  GMADD(W,RX,W,NV,NV) 

I  SN 

004n 

DO  4  1=1, NVV 

I  SN 

0049 

4 

RX ( I 1 =R  X 1 I )  '  ( NN- 1.01 

I  SN 

0050 

CA!  L  MXOUT (RX.NV.NV, 0,60, 132, 1 ,20, 20H  COVARIANCE  MATRIX  ) 

1  SN 

005  1 

IF(ND.NE.l)  GO  TO  15 

l  SN 

005  3 

CA!  L  APPAY! 1 ,NV,NV, 50,50, RX,F) 

I  SN 

0054 

DO  14  I = 1 , NV 

ISN 

0055 

1  4 

WRITF (7, 1  13)  <E(  I, J ) , J  =  1,NV) 

I  SN 

0056 

1  5 

CALL  MINV(RX,NV,DET,LL,MM| 

I  SN 

0057 

OFTEPM=ALOG  (OET) 

I  SN 

005ft 

HI LOG= (  NN-l.O)*DETFRM 

I  SN 

0059 

H110GS=H1L0GS+H1L0G 

I  SN 

0060 

5 

CONTINUE 

l  SN 

006  l 

Pf.WINn  2 

ISN 

0062 

CALL  MCPYIW,F,NV,NV,OI 

ISN 

006  3 

CA1L  MINV! E ,NV,DETG,LL,MM) 

ISN 

0064 

CALL  CORRE! NT, NV, 0,0.0, XBAR, ST D, RX ,COR , BH, DD, TT ) 

I  SN 

0065 

WRI TF (6, 101 ) 

ISN 

006  9 

CALL  GMSUB(RX,W,H,NV,NV) 

I  SN 

0067 

CALL  MXOUT! XBAR ,  1  ,NV ,0, 60 , 132, l ,24 , 24H  MEANS  OF  TOTAL  SAMPl E  ) 

ISN 

006M 

CALL  MXOUT! STD, 1 ,NV, 0, 60, 132, 1 ,40,40H  STANDARD  DEVIATIONS  FOR  TOTA 
1L  c  A  MP)  F  ) 

ISN 

0069 

CALL  MXOUT(C!JR,NV,NV, 1,60, 132,1, 36, 36H  CORRELATION  MATRIX  FOR  TOTA 
1L  SAMPLE) 

I  SN 

007  0 

DO  6  1=1, NVV 

l  SN 

0071 

6 

D (  I  )  =  W (  I  l/INT-NG) 

ISM 

0072 

C  Al  L  MI  NV!  0 ,  NV ,  ()ET  ,  LL  ,  MM  ) 

I  SN 

0073 

DETER M=ALOG(DET) 

I  SN 

0074 

WRITF 16, 1061 

I  SN 

0076 

WRITE (6,108)  DET 

-  , 

' 

■ 

.. 


. 
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I  SN 

0076 

H  1  LOG  =  (  NT-  NG )  *OET  EP.M 

I  SN 

0077 

XMM=H1L0G— H1L0GS 

1  SN 

0078 

CALI  MINV(RX,NV,DFT,LL,MM) 

I  SN 

0079 

OFTEPT=ALOG(DETI 

I  SN 

0080 

WRITEI6, 107) 

I  SN 

0081 

NDF=NG- 1 

I  SN 

008  2 

CAl  L  RAOFT(H,W,R,LL,MM,NV,NDF,i 

1  'N 

0083 

WP I TE 1 6 , 109 )  F  1  ,F2,F ,PRO,XLAMR 

ISM 

008  4 

PFWIND  2 

I  SN 

0085 

WRITE(6,110) 

1  SN 

0086 

CALL  MXOUT(W,NV,NV, 0,60, 132, 1, 

I  SN 

0087 

CALL  MXOUT(H,NV,NV, 0,60, 132,1, 

ISN 

0088 

CALL  GMPRO(H,E,R,NV,NV,NV) 

ISN 

0  08  9 

CALL  MXOUT(R,NV,NV, 0,60, 132,1, 

ISN 

0090 

CALL  MCPYIW,E,NV,NV,0) 

I  SN 

009  1 

CALL  MCPY(W,D,NV,NV.O) 

I  SN 

009? 

CALL  ROYTFSIW,H,R,X,NDF,NV,NG, 

I  SN 

0093 

WRITE(6,lll)  NS , FM, FN, ROOT 

I  SN 

0094 

WRITE (6,112) 

ISN 

0095 

IF  (ND.NE.l)  GO  TO  8 

I  SN 

0097 

CALL  ARRAY! l.NV.NV, 50,50, E, El 

I  SN 

0098 

PL)  7  I = 1 , NV 

I  SN 

0099 

DO  1  1  J  =  1 ,NV 

I  SN 

0100 

1  1 

C(  I , J)=E( I , J  )  / 1 NT-NG ) 

ISN 

0101 

7 

WRITEI7.113)  IE!  I,J).J=l,NV) 

I  SN 

0102 

8 

WP.ITE(6,U4) 

I  SN 

0103 

CALL  MXOUT  (R.NV.NDF,  0,60, 132,  1 
1 1 F  I CAT I  ON  ) 

ISN 

0104 

DO  12  1=1, NVV 

ISN 

0105 

12 

01 I 1=0! I ) / I NT-NG ) 

I  SN 

0106 

CALL  MXOUT! D ,NV,NV ,0,60, 132, 1, 

I  SN 

0107 

IF  (ND.NF.l!  GO  TO  10 

I  SN 

0109 

CALL  ARRAY! l.NV.NDF, 50,50, R,R) 

I  SN 

0110 

DO  9  1=1, NV 

I  SN 

0111 

9 

WRITE(7,113I  (R( I , J) , J=1,N0F) 

1  SN 

on? 

10 

CALL  SI GDS! X,NV ,NDF,NG,NT) 

I  SM 

0113 

GO  TO  1 

I  SN 

0  114 

9999 

STOP 

I  SN 

oils 

END 

******  EMI)  nr  COMP  II  AT  I  DM  ****** 

LEVEL  16  (  l  JULY  68)  OS/360  FORTRAN  H  OATF  69.172/05.04.34 


I  PN 


t  SM 
I  SN 
ISN 
ISN 
ISN 
ISN 
ISN 
ISN 
ISN 
ISN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
ISN 
I  SN 
I  SN 
T  SN 
I  SN 
I  SN 
ISN 
I  SN 
ISN 
I'N 
ISN 
ISN 
I  SN 
ISN 
ISN 
I  SN 
I  SN 
ISN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 


COMPILER  OPTIONS  -  NAN  E  =  MA I N , OP T  =  02 , L I NECNT=59 , SOURCE , EBCD I C , NOL I  ST , NODECK , LUAD, NOMAP , NOED IT ,  10, NOXREF 


0002 

FUNCTION  F I SHFR ( FM , FN , X ) 

C 

COMPUTES  PROBABILITY  OF  F-STATISTIC  X  WITH  M  AND  N 

DEGREES  OF 

C 

FREEDOM  im  NUMERATOR  AND  DENOMINATOR  RESPECTIVELY. 

C 

TRANSLATED  FROM  ALGOL  TO  360  FORTRAN  IV  BY  FLATHMAN 

,  APRIL,  1968. 

C 

ALGORITHM  322,  COMMUNICATIONS  OF  THE  ACM  11,2,116, 

FEBRUARY, 1968. 

0003 

M=FM*0.5 

0004 

N=FN*0.5 

0005 

FISHEP-l.O 

0006 

IF! X . L E.O..OR.M.LE.O.OR.N.LE.O)  RETURN 

0008 

INTEGER  A , B 

0009 

A  =  2* 1  M/2 ) — M+2 

0010 

R=2* (N/2 )-N*2 

001  l 

W=X*M/N 

0012 

2=1 . /( 1 • *W ) 

0013 

IF! A.NF.l IGO  TO  2 

0015 

IFIB.NE. 1 ) GO  TO  1 

0017 

P=SORT(W) 

0018 

Y=0. 3183099 

0019 

D=Y*Z /P 

0020 

P=2.*Y*ATAN(P) 

002  l 

GO  TO  4 

0022 

1 

P=SQRT( W*Z ) 

0023 

D=0.5*P*Z/W 

0024 

GO  TO  4 

002  5 

2 

IFIB.NE. 1IG0  TO  3 

0027 

P=SORT( Z) 

0028 

D=0 . 5*7*P 

0029 

P=l.-P 

0030 

GO  TO  4 

0031 

3 

n=z*z 

003? 

P  =  W*Z 

0033 

4 

Y=2.*W/Z 

0034 

J  =  B*2 

0035 

IFU.GT.NIGO  TO  6 

00  3  7 

5 

G=(  l.*A/( J-2. ) ) *D*Z 

0038 

IMA. EQ. 1)  P  =  P*D*Y/(  J-l  ) 

0040 

IF(A.NF.l)  P=(P*W)*Z 

0042 

J  =  J*2 

0043 

IF(J.LE.N)  GO  TO  5 

0045 

6 

Y  =  W*Z 

0046 

Z=2./Z 

0047 

P  =  N-2 

0048 

I  =  A*  2 

0049 

IF!  I.GT.MI  GO  TO  8 

0051 

7 

J=  I  *8 

005? 

D=Y*D*J/< 1-2 1 

0053 

P=P-Z*D/J 

0054 

1=1*2 

0055 

IM  I.LE.M)  GO  TO  7 

0057 

8 

F!SHER=1.-P 

0058 

RETURN 

0059 

END 

******  fmI)  nF  COMPILATION  ****** 

LEVFL  16  (  1  JULY  68)  0S/360  FORTRAN  H  DATE  69.172/05.04.40 


COMPILER  OPTIONS  -  NAME  =  M A  I N , 0PT=02 , L I N ECNT=59 , SOURC E, E BCO I C , NOL I  ST , N3DECK , L OAD , NOMAP , NOEDI T ,  ID , NOXR EF 


ISN  0002 

c 

SUBROUTINE  HOMO  I S ( XMM , NCELL , NV, NG, NT ) 

USING  BARTLETTS  METHOD ,  TEST  HOMOGENITY  OF  DISPERSION  MATRICES 

c 

XMM 

INPUT  ( 1 NT-NG ) * AL OG  DET  POOLED  WI-SUM  OF  t ( NC ELL ! I  ) -  1 ) * 

c 

ALOG  DET  W( I ) 

c 

NCELL 

NO  OF  OBSERVATION  IN  EACH  SAMPLE 

c 

NV 

SIZE  OF  DISPERSION  MATRICES 

c. 

NG 

LENGTH  OF  NCELL 

. 
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C 

NT  TOTAL  NO  OF  OBSERVATIONS. 

I  SN 

000* 

DIMENSION  NCELL1 1 1 

I  SN 

0004 

IDO 

FORMAT! IX, 'OFl*' ,F9. 1,2X, *DF2- • , F9 . 1 , 2X , • F-R AT  1 0=  • ,E16.8,2X, 

1*  PRORARIL ITY=* , F9.6) 

I  SN 

0005 

1 1  1 

FORMAT! IX, 'FOR  TEST  OF  HI  M*  ',E16.81 

I  SN 

0006 

113 

FORMAT! IX, • A2-A1  SQUARED  *',E16.8> 

I  SN 

0007 

116 

FORMAT! IX, 'A1  =•  .F14.7I 

1  SN 

OOOR 

117 

FORM  AT  I  IX, 'A2  «• ,F14.7> 

I  SN 

0009 

1  IB 

FORMAT ! IX, 1 B= ',E16.81 

I  SN 

0010 

F«1S=0.0 

I  SN 

0011 

GA1S=0.0 

I  SN 

001? 

DO  1  I = l , NG 

I  SN 

001* 

FA IS-FA1S+ 1.0/ INCELL!  I  1-1.01 

1  SN 

0014 

1 

GAlS*GAlS+l.O/( (NCELLI I 1-1.01**21 

I  SN 

0015 

F 1= . 5* (  NG-l.Ol*  NV* (  NV+l.O) 

ISN 

0016 

AlA=(FAlS-( l.O/INT-  NGI 1 1* ( (2.0*(  NV**2. 011+13. 0*  NV1-1.0I 

I  SN 

0017 

A1=A1A/(6.0*I  NG-l.Ol*!  NV+1.01) 

I  SN 

0018 

A?*(GAlS-( 1 .0/1  NT-  NG>**2) )*( (  NV-1.0)*!  NV  +  2. 0 )) / ( 6 . 0* 1  NG-l.Ol) 

I  SN 

0019 

Dl F  =  A2- ( A1**?.0) 

ISN 

0020 

WRITE(6,lll)  XMM 

I  SN 

0021 

WRITE(6,113I  DIF 

I  SN 

0022 

IF  (DIF  1  8,8,9 

ISN 

0023 

8 

F7=IFl+2.0)/( (Al**2.0>— A2) 

I  SN 

0024 

B  =  F  2/ ( l • 0-A 1  +  ( 2 . 0/F2 1  I 

I  SN 

0025 

F=(F?**XMM)/!Fl*!B-XMM) 1 

I  SN 

0026 

GO  +0  10 

ISN 

0027 

9 

F2=(Fl+2.0)/DIF 

I  SN 

0028 

B  =  Fl/( 1.0— A1  —  (FI/F2J 1 

I  SN 

0029 

F  =XMM/B 

C 

F  IS  FRATIO  FOR  TEST  OF  HQMQGENITY  OF  DISPERSION  WITH  F1,F2  OF 

I  SN 

0030 

10 

PR0=FISHER(F1,F2,F) 

ISN 

0031 

WR I T  F 1 6 , 1 00 1  Fl,F2,F,PR0 

I  SN 

0032 

WRI TF (6 , 1 16 1  A l 

ISN 

0033 

WRITE (6, 1 17)  A2 

ISN 

0034 

WRITE(6,113)  B 

ISN 

0035 

RETURN 

I  SN 

0036 

END 

****** 

ENO  OF 

COMPILATION  ****♦♦ 

LEVEL 

14  (  1 

JULY 

68  ) 

OS/360  FORTRAN  H 

DATE  69. 172/05. 04.46 


COMPILER  OPTIONS  -  NAME  =  MA I N , OP T  =  02 , L I NECNT  =  59 , SOURC E , E BCD  I C , NOL I  ST , NODECK , L OAD , NOMAP , NOED I T , I D, NOXREF 


I  SN 

0002 

r 

SUBROUTINE  MXOUT ( A , N , M , MS. L l NS , IPOS, I  SR , NUMHOL , T I TLE 1 

L 

c 

A  : NAME  OF  OUTPUT  MATRIX 

r. 

N  : NUMB  ER  OF  ROWS  IN  A 

c 

M  '.NUMBER  UF  COLUMNS  IN  A 

c 

MS  : STnRAGE  MODE  OF  A 

c 

0  GENERAL 

c 

l  SYMETRIC 

c 

2  DIAGONAL 

c 

LINS  : NUMBER  OF  PRINT  LINES  ON  THE  PAGE (USUALLY  601 

c 

IPOS  : NUMB  EM  OF  PRINT  POSITIONS  ACROSS  THE  PAGE (USUALLY  132) 

c 

ISP  : L I NF  SPACING  CODE,  1  FDR  SINGLE  SPACE  2  FOR  DOUBLE 

c 

SPACE 

I  SN 

0003 

DIMENSION  A! 1 ) , Bt 8) ,T ITLEI20) 

ISN 

0004 

1 

FORMAT  (IH0,/,IH0,20A4) 

ISN 

0005 

2 

FORMAT! 12X , 8HC0LUMN  ,7(3X, 13,10X1) 

I  SN 

0006 

3 

FORMAT! 1H  ) 

ISN 

0007 

4 

FORMAT ( IH  , 7  X ,4HR0W  , I 3 , 7 ( F 16. 6 ) ) 

I  SN 

0008 

5 

FORMAT! 1H0, 7X.4HR0W  ,I3,7!E16.6)) 

ISN 

0009 

NN=( NUMHOL+3 1 /4 

ISN 

0010 

WRITE16, 11  (TITLE! J) ,J  =  1,NN> 

ISN 

0011 

J  =  1 

I  s  N 

001? 

NFND=IP0S/16-1 

I  SN 

0013 

LEND=IL INS/ I SPI-2 

I  SN 

0014 

10 

L  STRT= 1 

I  SN 

0015 

20 

CONTINUE 

I  SN 

0016 

JNT= J+NEND— 1 

ISN 

0017 

31 

IF! JNT-M)33,33,32 

I  SN 

0018 

32 

JNT  =  M 

I  SN 

0019 

33 

CONTINUE 

ISN 

0020 

WR  I  T  F (6 , 2  )  ( JCUR ,JCUR=J, JNT) 

I  SN 

0021 

IF(ISP-l)  35,35,40 

I  SN 

002? 

35 

WRITEI6.3) 

I  SN 

0023 

40 

LTEND=LSTP,T  +  LFND-1 

I  SN 

0024 

DO  80  L*LSTRT,LTEND 

ISN 

0025 

DO  55  K  =  l , N END 

I  SN 

0026 

KK  =  K 

I  SN 

0027 

JT  =  J+K- 1 

ISN 

0028 

CALL  LOCIL.JT, I JNT ,N»M,MS) 

ISN 

0029 

B( K ) *0.0 

ISN 

0030 

IF (  I JNT )  50,50,45 

I  SN 

0031 

45 

P(K)=A( IJNT) 

I  SN 

0032 

50 

CONTINUF 

ISN 

0033 

IF! JT— M I  55,60,60 

I  SN 

0034 

55 

CONTINUF 

ISN 

0035 

60 

IF! ISP-1 )  65 ,65,70 

ISN 

0036 

65 

WRITE  16,4)  L, (B< JW) , JW»1,KK) 

I  SN 

0037 

GO  TO  75 

I  SN 

0038 

70 

WR I T  E I  6 , 5 )  L ,(B( JW) , JW*1,KK) 

ISN 

0039 

75 

IF(N-L)  85,85,80 

ISM 

0040 

80 

CONTINUE 

I  SN 

0041 

LSTPT=LSTRT+LEND 

I  SN 

004  2 

GO  TO  20 

I  SN 

0043 

86 

IFIJT-MI  90,95,95 

I  SN 

0044 

90 

J  =  J  T  +  1 

I  SN 

0045 

GO  TO  10 

I  SN 

0046 

05 

RETURN 

I  SN 

004  7 

END 

******  END  OF  COMPILATION  ****** 

LFVEL  16  I  l  JULY  6B I  OS/360  FORTRAN  H  DATE  69.172/05.04.54 

COMPILFR  OPTIONS  -  NAM  F  =  M A I N , OPT  =  02 , L I NECNT  =  59 , SOURCE , E BCD  I C , NOL I  ST , NODECK , LOAD, NOMA P , NOEO I T ,  ID, NOXRF F 
ISN  0002  SUBROUTINE  DAT  A ( N , X ) 

tSN  0003  DIMENSION  XI  1) 

ISN  0004  READ  (2)  (XI  I  I • I  *  1 , N ) 


. 

. 


— 


' 


ISN  0005 
! SN  0006 


RETURN 

FND 
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******  end  of  COMPILATION  ****** 
LEVEL  16  (  l  JULY  60) 


0S/360  FORTRAN  H 


DATE  60. 172/05. 04. 58 


ISN  0002 


COMP  I  I  FR  OPTIONS  -  NAME  =  MA  IN  ,  OPT=02  ,  L  I  NECNT=59 ,  SOURC  E ,  EBCD  I  C  .  NOL  I  ST  ,  NOOECK  ,  L  OAD ,  NOMAP  ,  NOFD  I  T  ,  ID ,  NC1XRI  F 
SUPROOTINF  RAOFTISA  , SE , SH , LL . MM , NP , NOF , DET , AL AMO A , NR , N T . F 1 , F2 , FR , 

1  PRO  I 

C  PUPOSE  PERFORMS  RAOS  APPROXMATE  F  TEST 


c 

SA 

D.P.  MATRIX  FOR  HYPOTHESIS 

c 

SE 

O.P.  MATRIX  FOR  ERROR 

c 

SH 

WORKING  VECTOR  OF  LENGTH  NP*NP 

c 

LL  ,  MM 

WORKING  VECTOR  OF  LENGTH  NP 

c 

NP 

SIZE  OF  MATRIX  SA,$E 

c 

NDF 

RANK  OF  C  MATRIX  NG- l 

c 

DFT 

INPUT  DETERMINANT  OF  SE  MATRIX 

c 

ALAMDA 

OUPUT  WILKS  LAMOA 

c 

NR 

RANK  OF  DESIGN  MATRIX  OR  NG 

r. 

NT 

TOTAL  NO  OF  OBSERVATIONS 

c 

FI 

OUTPUT  DF 1 

c 

F  2 

OUTPUT  DF2 

c 

FR 

OUT  PUT  FRATIO 

c 

PRO 

OUT  PUT  PROBABILITY 

ISN 

noo3 

DIMENSION  SA I 1) ,SHI 1 ),SE(1 ),LL( ll.MMI 1) 

I  SN 

0004 

CALL  GMADDISA.SE, SH.NP.NPI 

I  SN 

0005 

CALL  MINVISH.NP, DA, LL.MMI 

I  SN 

0006 

alamda=det/da 

I  SN 

0007 

NS=MINO(NP,NDF) 

ISN 

0008 

IFINS.EQ.I)  GO  TO  10 

I  SN 

0010 

IFINS.E0.2)  GO  TO  11 

ISN 

0012 

S=  SORT  I  ( (NP**2)*INDF**2 1-4.01/1 NP**2*NDF**2-5. 01 ) 

I  SN 

0013 

F 1=NP*NDF 

ISN 

0014 

FM-NT-NR- (NP-N0F+ 1 .0 ) /2 .0 

I  SN 

0015 

TT=(NP*N0F-2.0I /2.0 

ISN 

0016 

F2=Em*S-TT 

I  SN 

0017 

AL=ALAMDA**( l.O/S) 

ISN 

0018 

GO  TO  20 

ISN 

0019 

10 

F 1 = 1 .0* I ARS I NDF-NPI 

ISN 

0020 

F2= 1.0+NT-NR-NP 

ISN 

0021 

AL=ALAMDA 

I  SN 

0022 

GO  TO  20 

I  SN 

0023 

1  1 

F1=4.0+2.0*IABS(NDF-NPI 

I  SN 

0024 

F2=2.0+2.0*( NT-NR-NP) 

ISN 

0025 

AL  =  SQRT I ALAMDA) 

ISN 

0026 

20 

FR  =  t  I 1.0-AL  )  *F2) /( AL*Fl I 

I  SN 

0027 

PR0=FISHER(F1,F2,FR) 

I  SN 

0028 

RETURN 

ISN 

0029 

END 

****** 

END  OF 

COMPILATION  ****** 

LEVEL  16  I  1  JULY  68) 


OS/360  FORTRAN  H 


OATE  69.172/05.05.04 


ISN  0002 


ISN  0003 
ISN  0004 
ISN  0005 
ISN  0006 
ISN  0007 
ISN  0008 
ISN  0010 
I SN  0011 
ISN  0012 
ISN  0013 
ISN  0014 
ISN  0015 


COMPILER  OPTIONS  -  NAME*  MA I N , OPT =02 . L I NECNT=59 , SOURC E , E8CD I C , NOL I  ST , NODECK . LO AD, NOMA P , NOEO I T ,  10, NOXREF 
SUBROUTINE  ROY TE S < SE , SA , SH , XL A , NOF , NP , NR , NT , NS , FM , FN , AMAX > 

C  COMPUTE  EIGENVALUES  OF  <SA)*(SE-1) 

C  SE  INPUT  MATRIX 

C  SA  INPUT  MATRIX 

C  SH  OUT  PUT  EIGENVECTORS 

C  XL  A  OUTPUT  EIGENVALUES  VECTOR 

C  NDF  INPUT  DF 

C  NP  INPUT  NU  OF  VARIABLES 

C  NR  THE  RANK  OF  DESIGN  MATRIX 

C  NT  TOTAL  NO  OF  OBSERVATION 

C  NS  FM  FN  AMAX  PARAMETERS  FOR  MAX  ROOT  TEST 

DIMENSION  SE ( 1 )  ,SA( l ) , SHI  1 ) , XL  A ( 1 ) 

CALL  NROOT(NP,SA,SE,XLA,SH) 

CALL  MXOUTI XLA, 1 ,NP, 0,60, 132,1. 12, 12H  EIGENVALUES) 

AMAX=— 999999.0 
DO  1  1=1, NP 

1  IFIXLAl I I.GT.AMAX)  AMAX=XLA(II 
AMAX=AMAX/( 1 .O+AMAX) 

NS  =  M I  NO  I NOF , NP ) 

FM=< IABSINOF-NPI-1.0I/2.0 
FN=(NT-NR-NP-1.0) /2.0 
RETURN 
END 


******  END  OF  COMPIl ATI  ON  ****** 

LEVEL  16  (  1  JULY  68)  OS/360  FORTRAN  H  DATE  69.172/05.05.09 

COMPILER  OPTIONS  -  NAM  E=  MA  I  N  ,  0PT  =  02  ,  L  I  NECNT  =  59  ,  SOURC  F  ,  E  BCD  I  C  ,  NOL  I  ST  ,  NODECK  ,  LOAD ,  N'J  MAP ,  NOED  I  T  ,  ID ,  NOXREF 


I  SN 

0002 

C 

FUNCTION  CHIPRBICHI.NDF) 

COMPUTES  PROBABILITY  OF  CHISQUARE  CHI  WITH  NDF  DEGREES  OF  FREEDOM 

I  SN 

0003 

EXTERNAL  ERF , SORT 

ISN 

0004 

REAL  NORMAL 

I  SN 

0005 

INTEGER  F 

I  SN 

0006 

LOGICAL  BIGX.EVEN 

ISN 

0007 

C 

NORMAL ( X) =0.5*1 1 .0*FRF I  0. 7071 06  8* X )  1 

SFF  KENNEY  AND  KEE P I NG ( 1 95 1 )  VOL. 2,  P.43,  WHERE  ERF  IS  CALLED  G. 

I  SN 

0008 

F  =  N0F 

I  SN 

0009 

X  =  CH  I 

ISN 

0010 

CHIPRB=t.O 

I  SN 

0011 

C 

IFIX.LE.O..OR.F.LT.1)  RETURN 

LABEL  *  WRONG*  IS  OMITTED  IN  FAVOR  OF  RETURNING  CHIPRB=1.0 

I  SN 

0013 

A=0.5*X 

I  SN 

0014 

c 

BIGX=A.GT.10. 

BIGX  SHOULD  BE  TRUE  WHEN  X  IS  SO  BIG  THAT  EXPI-A)  IS  NOT  ACCURATE 

I  SN 

0015 

EVFN=(2*(F/2)-F).EO.O 

ISN 

0016 

IF  1 FVFN.OR. I F.GT .2. AND. .NOT. BIGX) )  Y  =  FXP(-A  ) 

ISN 

0018 

I F I  EVEN )  S  =  Y 

I  SN 

0020 

IF  I .NOT. EVEN)  S  =  2 . 0*N0RMAL I -SORT ( X ) ) 

I  SN 

0022 

CH 1  PR  R=  S 

I  SN 

0023 

IFIF.LF.2)  RETURN 

ISN 

0025 

X=0. 5* I F-l . 0 ) 

ISN 

0076 

IF(EVEN)  Z=l.O 

I  SN 

0028 

IF! .NOT. EVEN)  Z=0.5 

. 
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ISN 

0030 

IF! .NOT.RIGX)  GO  TO  2 

ISN 

0032 

I F 1  EVEN )  E  =  0. 

I  SN 

0034 

IF ( .NDT.EVFN)  E=0. 5723649 

I  SN 

0036 

C=ALOG( A) 

ISN 

0037 

1 

E=  ALOG! Z  )  +E 

ISN 

0038 

S=FXP(C*Z-A-E)+S 

ISN 

0039 

Z=Z*l .0 

I  SN 

0040 

IF(Z.LE.X)  GO  TO  1 

ISN 

004? 

CHIPRB=S 

I  SN 

0043 

RETURN 

ISN 

0044 

2 

IF(FVEN)  F= 1 . 0 

ISN 

0046 

IF  I . NOT , E VFN )  F  =  0. 564 1 896/ SORT ( A) 

I  SN 

0048 

C  =  0. 

I  SN 

0049 

3 

E=F*A/Z 

ISN 

0050 

r.*r.*E 

I  SN 

0051 

Z  =  Z+  1 . 0 

I  SN 

005? 

IF (Z.LE.X)  GO  TO  3 

I  SN 

0054 

CHIPRR=C*Y*S 

ISN 

0055 

RETURN 

ISN 

0056 

END 

****** 

FNO  OF 

COMPILATION  ****** 

LEVCL  16  (  1  JULY  68)  OS/360  FORTRAN  H  DATE  69.172/05.05.15 


ISN  0002 


ISN  0003 
ISN  0004 

ISN  0005 
ISN  0006 
ISN  0007 
ISN  OOOR 
ISN  0009 
ISN  0010 
I SN  001  1 
ISN  0012 
ISN  0013 
ISN  0014 
ISN  0015 
ISN  0016 
ISN  0017 
ISN  0018 
ISN  0019 
ISN  0020 


COMPILER  OPTIONS  -  NAME-  MA  I  N  ,  0PT=02  ,  L  I  NECNT-59 , SOURC E , E BCD  I C . NOL I  ST , NODECK , L 3 AD , N3MAP ,  NOED I T ,  10 , NOXRE F 


SUBROUTINE 

SIGDS(AL,NV,NF,NG,NT) 

C 

PURPOSE 

TEST  SIGNIFICANCE  OF  ROOTS  HIS  THE 

SUM  OF  THE 

C 

AND  SMALLER  ROOT  IS  EQUAL  TO  0 

C 

SEE  RAO  1965  P  474 

C 

AL 

INPUT  VECTOR  OF  ROOTS 

C 

NV 

NO  OF  ORIGINAL  VARIABLES 

C 

NF 

NO  OF  ROOTS  TO  BE  TESTED,  SHOULD  BE 

LE  NG- 1 

C 

NG 

NO  OF  GROUPS 

C 

NT 

NU  OF  TOTAL  OBSERVATIONS 

DIMENSION  AL(l) 

ICO  FORMAT!  1H0,I3,'TH  ROOT='  , F9.4 ,5X , • 0F= ' ,  I5,5X,'CHI=',F10.4,5X, 

1 • PROS 40 ! L I TY  = ' , F9.6.5X, • PERC ENT* • , F 6. 3  I 
101  FORMAT!/, /,IH0, 'SIGNIFICANCE  TEST  OF  ROUTS  SEF  RAO  1965  P  474'l 
WRITFI6, 101) 

TRAC  E  =  0 .0 
DO  1  1=1, NV 
1  TR ACE =T RACE +  AL (  I  I 
on  3  K= 1 , NF 
CHl=0.0 


DO  2  J=K,NV 

2  f.HI*CH!  +  ALOG(  1 . 0/ (  1  .0*AL  (  J  ))) 
CHI=-!NT-NG-INV+NGI/2.0)*CHI 
NDF=  (  NV-K*  1  )*<NG-KI 

PRO=CH I  PRO ( CHI , NDF ) 

PEK= ( 1 0  0*  AL ( K ) ) /TRACE 

3  W  R  I  T  E  (  6  , 100)  X, AL IK) , NDF, CHI , PRO, PER 
RETURN 

END 


******  END  OF  COMPILATION  ****** 

LEVEL  16  I  1  JULY  68)  0S/360  FORTRAN  H  DATE  69.172/05.05.20 


ISN  0002 
ISN  0003 
ISN  0004 
ISN  0005 


COMPILER  OPTIONS  -  NAME=  MA  1 N ,  0PT  =  02  ,  L  I  NECNT  =  59 , SOURC E, E8CD I C , NOL I  ST , NODECK , L 3  AD , NO  MAP , NOED I T , ID ,  NOXRFF 
SUBROUTINE  0 ATR AN ( N , X ( 

DIMFNSION  X! 1) 

RETURN 

END 


******  end  OF  COMPILATION  ****** 

U  OF  A  JOB  STATISTICS  —  449  CAROS  READ  —  505  LINES  PRINTED  —  0  CARDS  PUNCHED  —  1.03  MINUTES  EXECUTION  TIME 


. 


APPENDIX  III 


LEVEL  16  (  t  JULY  681 


0S/360  FORTRAN  H 
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DATE  69.171/12.89.20 


COMPILER  OPTIONS  -  NAM  E=  MA IN , OPT=02 , L INECNT=59 , SOURCE , FBCO I C , NOL I  ST , NOOECK , L DAD, NOMAP , NOEO I T , 1 0, NOXRE F 


DIVISION  OF  EDUCATIONAL  RESEARCH  SERVICES 
UNIVERSITY  OF  ALBERTA 
MULV1 1 

PURPOSE:  CLASSIFY  SUBJECTS  BASED  ON  PROBABILITY  CALCULATED  BY 

DISCRIMINANT  ANALYSIS 


CARD  INPUT 
1  1 
2) 

3) 

A  1 
51 
6) 

7) 


6  1 
91 
101 

PARAMETERS 

NP 

NG 

NV 

NF 

I  PROB 


NID 


SUBPROGRAMS 


PROGRAMMER 


T I T  EL  E ( 20 A4 ) 

PARAMETERS! 1615 1  NP.NG, NV, NF, l PROB.NID 
FORMAT  FOR  INPUT  M ATR I CE S I 20A4 ) 

C-NGXNV  CENTROID  MATRIX  IN  ORIGINAL  SPACE 

D-NVXNV  OISPERSION  MATRIX  IN  ORIGINAL  SPACE 

A-NVXNF  TRANSFORMATION  MATRIX 

14,5,6  ARE  OUT  PUT  FROM  MULVIO  PROGRAM  1 

IF  I PROB= 1 ,  NO  OF  OBSERVATIONS  IN  EACH  GROUP  USED  FOR 

POPULATION  STUDY  11615) 

IF  IPR0B=2  INPUT  A  PRIORI  PROBABILITY  FOR  EACH  GROUP 
I 16F5.5) 

FORMAT  FOR  THE  DATAI20A4I 
INPUT  SAMPLE  DATA 
A  BLANK  CARD 

NO  OF  PFRSON  TO  BE  CLASSIFIED 
NO  OF  GROUPS 

NO  OF  VARIABLES  IN  ORIGINAL  SPACE 
NO  OF  VARIABLES1FACT0RS)  AFTER  TRANSFORMATION 
INDICATES  A  PRIORI  PROBABILITY  TYPE  FOR  APPLYING 
BAYES  RULE  TO  CALCULATE  A  POSTERIOR  PROBABILITY  WHEN 
THE  TEST  SCORES  ARE  GIVEN  FOR  EACH  INDIVIDUAL  TO  BE 
CLASSIFIED 

1-  PROPORTIONAL  TO  NO  OF  OBSERVATION  IN  EACH  GROUP 
USED  TO  OBTAIN  INPUT  MATRICES 

2- TO  BE  SPECIFIED  BY  CARD  INPUT ( 10F8.6 I 

IF  NOT  l  OR  2  ASSUME  EQUAL  PROPABILITY  FOR  ALL  GROUPS 
IF  N  I  D=  1  INDICATE  THERE  ARF  ID  NUMBERS  FOR  INPUT  DATA 
CARDI6A1) .OTHERWISE  ASSUME  NO  ID  NUMBER 
CLA  SSI , MX OUT ,CH I  PR  B 
SSP  ARRAY, CCPY.GMPRD.GMTRA.GTPRD 
K.  BAY 


1  SN 

0002 

DIMENSION  TITLE! 20 l.FMTI 20), A! 50, 251,0(50, 501, CI50, 50), CC<25, 501, 
1DD(25,25),S(50,50),P(50),PP(50),NCELL!50I,LL(50I,MM(50),SSI50,50I, 
2SSS! 50,50) .DATA! 501 

I  SN 

0003 

100 

FORMAT! 20A4) 

ISN 

0004 

101 

FORMAT! 1H1, 10X.20A4) 

I  SN 

0005 

102 

FORMAT! 1615) 

I  SN 

0005 

103 

FORMAT ( 1H0, • FORMAT  FOR  TR AN SFOR MA T I  ON , 0 1 SP E P S I  ON ,  AND  CFNTRUI D  MAT 
IRI X' 1 

I  SN 

0007 

104 

FORMAT! 1X.20A4) 

1  SN 

0008 

105 

FORMAT! IX, 'FORMAT  FUR  THE  DATA'I 

I  SN 

0009 

106 

FORMAT! 1H0,  'NO  OF  SUB J FC T S • , l 9 X , I 5 , / , 1 X , ■ NO  OF  GROUPS • , 2 1 X , [ 5 , / , 

1 1 X , ' NO  OF  VARI ABLES' , 18X, 15, / , l X, 'NO  OF  VARIABLES  IN  REDUCED  SPACE 
2* , IX, 15) 

ISN 

0010 

L  0  7 

FORMAT! IHO, 'NO  UF  OBSERVATION  IN  EACH  CELL  USFD  FOR  ORIGINAL  DATA' 
1 ) 

FORMAT ( IX, 1615) 

I  SN 

0011 

108 

I  SN 

0012 

109 

FORMAT! I6F5.5) 

I  SN 

0013 

110 

FORM  AT ( IHO ,  'NO  OF  OBSERVATION  IN  EACH  GROUP  IN  POPULATION  STUDY') 

I  SN 

0014 

1 

READI5, 100, END=9999)  TITLE 

I  SN 

00  15 

IF! TITl E( 1 ) . EU.T I TLE! 2 1 )  GO  TO  9999 

I  SN 

0017 

WKITE(6,101 )  TITLE 

I  SN 

0018 

RE  AD! 5, 102 1  NP, NG.NV , NF , I PROB, N I D 

I  SN 

0019 

WRITF(6,I06)NP,NG,NV,NF 

I  SN 

0020 

READI5, 100)  FMT 

I  SN 

0021 

WRI TEI6.103 ) 

I  SN 

0022 

WR!TE(6,104)  FMT 

I  SN 

0023 

DO  2  1=1, NG 

I  SN 

0024 

2 

READI5 , FMT )  ( C( I , J ) , J= 1 , NV ) 

ISN 

0025 

CALL  ARRAYI2, NG.NV, 50,50, C,C) 

I  SN 

0026 

CALL  MXOUTIC, NG.NV, 0,60, 132, 1,40, 40H  INPUT  CENTROID  MATRIX  GROUPS 

1  BY  VARS  ) 

ISN 

0027 

DO  3  1=1, NV 

I  SN 

0028 

3 

RE AD ( 5 , FMT )  ( D( I , J 1 , J= l , NV ) 

ISN 

0029 

CALL  ARRAY! 2, NV,NV, 50,50,0,0) 

I  SN 

0030 

CALL  MXOUTID, NV.NV, 0,60, 132, 1,24, 24H  INPUT  DISPERSION  MATRIX) 

I  SN 

0031 

DO  4  I = 1 , NV 

I  SN 

0032 

4 

READ(5,FMT)  ( A  1  I , J ) , J= 1 , NF ) 

I  SN 

0033 

CALL  ARRAY ( 2 ,NV ,NF , 50, 25, A , A ) 

ISN 

0034 

CALL  MX OUT ( A ,NV, NF,0 ,60, 132, 1, 28.28H  INPUT  TR ANSFORM AT  I  ON  MA T R I X ) 

I  SN 

0035 

CALL  GMPROIC ,A, S,NG,NV,NF> 

I  SN 

0036 

CALL  GMTRAtS.CC.NG.NF) 

I  SN 

0037 

CALL  MX OUT ( CC  , NF ,NG , 0 , 60 , 1 3 2 , 1 , 52 , 52H  CENTROID  MATRIX  IN  REDUCED 
1SPACE  FACTOR  BY  GROUPS  ) 

ISN 

0038 

CALL  GTPRD! A,D,S,NV,NF,NV) 

I  SN 

0039 

CALL  GMPRD! S , A , DD , NF , NV , NF ) 

I  SN 

0040 

CALL  MXOUTIDD.NF.NF, 0,60, 132,1, 36, 36H  OISPERSION  MATRIX  IN  REDUCED 
I  SPACE  ) 

I  SN 

004  l 

IF  ( IPROB.NF. 1 1  GO  TO  11 

ISN 

0043 

READI5, 10?)  (NCELL ( I 1 , 1=1 ,NG) 

I  SN 

0044 

WRITE(6,110) 

I  SN 

0045 

WRITE(6,10R)  (NCELL! I 1 , 1=1, NG) 

l  SN 

0046 

FCELL=0.0 

I  SN 

0047 

DO  5  1=1, NG 

I  SN 

0048 

5 

FCELL=FCELL+NCELLI I ) 

I  SN 

0049 

DU  6  1=1, NG 

I  SN 

0050 

6 

PI  I ) =NCELL ( I  l/FCELL 

I  SN 

0051 

GO  TO  10 

I  SN 

0052 

1  1 

CONT INUF 

ISN 

0053 

7 

IF! IPR0B.NE.2)  GO  TO  9 

I  SN 

0055 

REAOI 5, 109)  IP! I ) , 1=1 ,NG) 

I  SN 

0056 

GO  TO  10 

I  SN 

0057 

9 

PPP= 1 . 0 /NG 

I  SN 

0058 

DU  8  I = 1 ,NG 

I  SN 

0059 

8 

P ( I ) =PPP 

I  SN 

0060 

10 

CALL  MXOUTtP, 1 ,NG, 0,60, 132, 1,24, 24H  PRIORI  PROBABILITY  USED) 

I  SN 

006  1 

READ! 5, 100)  FMT 

■ 


' 
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ISN  0062  WRITE  16, 105) 

ISN  0063  WRITEI6.104)  EMT 

ISN  0064  CALL  M I NV< DO , NF , DET , LL . MM ) 

ISN  0065  CALL  MXOUT ( DO, NF .NF , 0 , 60 , 13 2, l , 32 , 32H  INVERSE  OF  DISPERSION  MATRIX 

l  ) 

I SN  0066  WRITFI6, 101) 

ISN  0067  CALL  CL  ASS M NP , NG ,NV , NF , NCELL . P . A, CC , EMT , DO , PP , S , SS , SSS , DA TA . N I D ) 

ISN  0060  r.O  TO  1 

ISN  006R  9999  STOP 

ISN  0070  END 


******  END  OF  COMPILATION  ****** 
LEVEL  16  I  I  JULY  6R) 


0S/360  FORTRAN  H 


OATF  60. 171/12. 50. 30 


COMPILER  OPTIONS  -  NAME®  M A I N ,0P T®02 , L I NECNT*59 , SOURCE . FBCD I C . NOL I S T , NOOECK . L OAO, NOMAP , NOED I T , I D , NOXRF F 


ISN  0002 


SUBROUTINE  CLASSI(NP,NG,NV,NF,NCELL,P,FL,CENT,FMT,D,PP,CH!,XX,GM, 
1DD.NIDI 
PURPOSE  : 

CLASSIFY  SUBJECTS  BASED  ON  NV  OBSERVATION  AND  POPULATION 
CENTROID,  DISPERSION  IN  REDUCED  SPACE  AND  TRANSFORMATION 
MATRICFS 

NP  NO  OF  PERSONS  TO  BE  CLLASIFIEO 

NG  NO  OF  GROUPS 

NV  NO  OF  INPUT  VARIABLES  . 

NF  NO  OF  VARIABLES  IN  REDUCED  SPACE,  USUALLY  NG-1 

P  INPUT  VECTOR  INDICATING  A  PRIORI  PROBABILITY  FOR  EACH 

GROUP 

FL  INPUT  TRANSFORMATION  MATRIX  OF  SIZE  NVXNF 

CENT  INPUT  NFXNG  CENTROID  MATRIX 

EMT  FORMAT  FOR  THE  DATA 

D  INVERSE  OF  DISPERSION  MATRIX  IN  REDUCED  SPACE 

LL , MM  WORKING  VECTORS 

DD,  CHI  WORKING  VECTORS  OF  SUE  NV  AND  NG 

NIL)  IF  EO  TO  l  ID  NO  IS  INPUT  FOR  EACH  SUBJECT, 

OTHERWISE  ASSUME  NO  ID  INPUT 


I  SN 

0003 

DIMENSION  NCELL ( 1 ) ,P( 1 1 ,PP( 1 ), FLI 1) , CENT ( l ) ,FMT(  1 )  ,D(  1 )  ,D0(  1) 

1 XXI 1 1 , GM 1 1 ) , CH I ( 1 ) 

I  SN 

0004 

DIMENSION  I D 1 6 ) 

I  SN 

0005 

DATA  IU/•*•,,*,.,*,,'*,., ♦','*'/ 

I  SN 

0006 

100 

FORMAT  1 IX.BX, 'CHISQ' , 8X.L0I IX.F8.4) , /,  l X, 2 1 X , 1 0 1  1 X , FS . 4 ) ) 

I  SN 

0007 

101 

FORMAT  I IX,  8X  ,  'POSTERIOR  P  •  ,  2X  ,  101  l  X  ,  F8 . 4 )  ,  /  ,  2(  1  X ,  2  IX  ,  1 0 1  IX,  F8 
1/  ,  )  ) 

I  SN 

0008 

102 

FORMAT  I IH0.6A1 , 2X, • SCORE •,8X,10(IX,F8.4)) 

I  SN 

0009 

103 

FORMAT ( 1X,8X, 'CONDITION  P',2X,10(1X,F8.4),/,?IIX,21X,10IIX,F8 
1/,  )  1 

I  SN 

0010 

104 

FORMAT! IX, ' . CL  ASS  I F I  CAT  I  ON . . . . . • , 5X , • COND I T ION AL : '  ,  2X ,  1 2  , 

I  SN 

0011 

DO  11  1*1, NP 

I  SN 

0012 

IF  (NID.EQ. 1)  GO  TO  1 

I  SN 

0014 

PEAU(5,FMT)  1 DD( J 1 , J= 1 , NV ) 

I  SN 

0015 

GO  TO  2 

I  SN 

0016 

i 

READI5,FMT)  (I0(K),K*1,6),(DDIJI,J*1,NV) 

ISN 

0017 

2 

CALL  GMPRDIDD.FL, XX, 1 ,NV,NF ) 

ISN 

0018 

WRI TE (6, 102 )  I IDIK I ,K»1 ,6) , 1 XXI J) , J=l ,NF ) 

I  SN 

0010 

DO  5  M® 1 , NG 

I  SN 

0020 

CAIL  CCPY(CENT,M,GM,NF,NG,0) 

I  SN 

002  1 

DO  3  L=  1 ,  NF 

I  SN 

0022 

3 

GM I L ) * XX  I L )-GM( L 1 

I  SN 

0023 

CALL  GMPRD 1 0 , GM , DD, NF , NF , 1 ) 

I  SN 

0024 

X2*0 .0 

I  SN 

0025 

DO  4  J* 1 , NF 

ISN 

0026 

4 

X2*  X2  *GM I J ) *DD( J ) 

I  SN 

0027 

5 

CHI (M)*X2 

I  SN 

002B 

SUM*0. 0 

I  SN 

0020 

DO  6  J® l , NG 

I  SN 

0030 

PPI  J)*P( J)*EXP(-CHI I Jl/2.0) 

I  SN 

0031 

6 

SUM®  SUM  +  PP I J ) 

ISN 

0032 

DO  7  J® 1 , NG 

I  SN 

0033 

7 

PP I J ) *P  P l J ) / SUM 

ISN 

0034 

WRITEI6.100)  (CHI  I J) ,J=1,NG) 

I  SN 

0035 

WRITE(6,101)  1 PPI J) , J*1 ,NG) 

ISN 

0036 

NCB®  1 

1  SN 

0037 

DO  8  J*2 ,NG 

1  SN 

0038 

8 

IF l PPI J  1  .GT .PPI NCB) )  NCB* J 

1  SN 

0040 

DO  9  J  *  1 , N  G 

I  SN 

0041 

9 

PPI J ) *CH I PRR (CHI ( J) ,NF) 

I  SN 

0042 

WRI TE I  6 , 103 )  (PPI J)  ,J*1,NG) 

I  SN 

0043 

NCA*  1 

I  SN 

0044 

DO  10  J* 2 , NG 

I  SN 

0045 

10 

IFIPPI JI.GT.PPINCA) )  NCA® J 

I  SN 

0047 

WRITEI6.104)  NCA, NCB 

I  SN 

0048 

1 1 

CONTINUE 

ISN 

0040 

RETURN 

I  SN 

0050 

END 

♦* 

END  OF 

COMPILATION  ****** 

LEVFL 

16  I  1 

JULY  68) 

OS/360  FORTRAN  H 

DATE  69.171/12.59.55 


COMPILER  OPTIONS  -  NAME®  MAIN»OPT*02, LINECNT*59, SOUR CE.ERCDIC, NOLIST, NODECK, LOAD, NO MAP, NO EDIT, ID.NOXREF 
ISN  0002  SUBROUTINE  MXOU T I  A , N , M , MS , L I  NS , I POS , I SP , NUMHOL , T I TLE I 

C 

C  A  : NAME  UF  OUTPUT  MATRIX 

c  N  : NUMB  EK  OF  ROWS  IN  A 

C  M  : NUMBER  OF  COLUMNS  IN  A 

C  MS  : STORAGE  MODE  OF  A 

C  0  GENERAL 

C  1  SYMETRIC 

C  2  DIAGONAL 

C  LINS  : NUMB  ER  OF  PRINT  LINES  ON  THE  PAGE ( USUALLY  60 1 

C  IPOS  : NUMB  ER  OF  PRINT  POSITIONS  ACROSS  THE  PAGE  I  USUALLY  132) 

C  ISP  :L  INE  SPACING  CODE,  l  FOR  SINGLE  SPACE  2  FOR  DOUBLE 

C  SPACF 


,  71 3X, I  3, 10X)  ) 


I3.7IE16.6)  ) 


I  SN 

0003 

DIMENSION  A( l ) , B<  81 

1  SN 

0004 

1 

FORMAT  I  1  HO , / , 1H0 , 

I  SN 

0005 

2 

FORMAT! 17X, SHCOLUMN 

I  SN 

0006 

3 

FORMAT! 1H  1 

I  SN 

0007 

4 

FORMAT  I 1H  , 7X , 4HR0W 

I  SN 

0008 

5 

FORMAT! | HO , 7X.4HR0W 

-  . 
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I  SN 

0009 

NN»( NUMHOL+31/4 

ISN 

0010 

WR 1 TE (6  *  1 )  ITITLE(J),J*l,NN) 

1  SN 

0011 

J*1 

ISN 

0012 

NEND=lP0S/16-l 

ISN 

0013 

LEND=(L!NS/ISP)-2 

ISN 

0014 

10 

LSTRT* 1 

I  SN 

0015 

20 

CONTINUE 

ISN 

0016 

JNT* J+NEND— 1 

I  SN 

0017 

31 

IF! JNT-M)33,33,32 

ISN 

0018 

32 

JNT  =  M 

I  SN 

0019 

33 

CONTINUE 

I  SN 

0020 

WRITE (6*2)  I JCUR  *  JCUR- J  t  JNT ) 

I  SN 

0021 

IFIISP-l)  35,35,40 

I  SN 

0022 

35 

WR I TF I  6 , 3 ) 

I  SN 

0023 

40 

LTEND=LSTRT*LEND-l 

ISN 

0024 

DO  80  L=LSTRT ,LTENO 

ISN 

0025 

DO  55  K= 1 , NEND 

I  SN 

0026 

KK=K 

I  SN 

0027 

JT-J+K-1 

I  SN 

0028 

CALL  LOCIL , JT, I JNT,N,M,MS) 

ISN 

0029 

B<KI=0.0 

I  SN 

0030 

IFIIJNT)  50,50,45 

I  SN 

0031 

45 

B(K)=A(  IJNT) 

ISN 

0032 

50 

CONTINUE 

I  SN 

0033 

IFIJT-MI  55,60,60 

1  SN 

0034 

55 

CONTINUE 

I  SN 

0035 

60 

IFIISP-l)  65,65,70 

ISN 

0036 

65 

WR I TE I  6 , 4 )  L,( 81 JW) , JW-l.KK) 

ISN 

0037 

GO  TO  75 

I  SN 

0038 

70 

WRITE (6 ,5  )  L,  (  8(  JW) , JW  =  1 ,KK) 

ISN 

0039 

75 

IFIN-L)  85,85,80 

ISN 

0040 

80 

CONTINUE 

ISN 

0041 

LSTRT=LSTRT+LEND 

I  SN 

0042 

GO  TO  20 

I  SN 

0043 

85 

IFIJT-M)  90,95,95 

I  SN 

0044 

90 

J=JT*1 

I  SN 

0045 

GO  TO  10 

I  SN 

0046 

95 

RETURN 

ISN 

0047 

END 

****** 

END  OF 

COMPILATION  ****** 

LEVEL 

16  I  1 

JULY 

68) 

OS/360  FORTRAN  H 

COMPILER  OPTIONS  -  NAME=  MA IN , 0PT=02 , L INECNT  =  59 , SOURC E, EBCD IC , NOL I  ST , NODECK 

ISN 

0002 

FUNCTION  CHIPRBICHI ,NOF) 

c 

COMPUTES  PROBABILITY  OF  CHISOUARE  CHI  WITH  NDF  DEGREES 

OF  FRFEDOM 

I  SN 

0003 

EXTERNAL  SQRT.ERF 

ISN 

0004 

REAL  NORMAL 

ISN 

0005 

INTEGER  F 

I  SN 

0006 

LOGICAL  BIGX.EVEN 

ISN 

0007 

NORMAL  (  X)  =  0. 5*1  1  .O  +  ERF I  0.7071068* (-SORT  I  XI  )  )  ) 

c 

SEE  KENNEY  AND  KEEP INGI 1951 1  VOL. 2,  P.43,  WHERE  ERF  IS 

CALLED  G. 

I  SN 

0008 

F-=NDF 

I  SN 

0009 

X=CHI 

I  SN 

0010 

CH I PRB= 1 .0 

ISN 

0011 

IF ( X.LE.0..0R.F.LT.1 1  RETURN 

c 

LABEL  *WRONG*  IS  OMITTED  IN  FAVOR  OF  RETURNING  CHIPRB* 

1.0 

I  SN 

0013 

A=0.5*X 

I  SN 

0014 

BIGX=A.GT. 10. 

c 

BIGX  SHOULD  BE  TRUE  WHEN  X  IS  SO  BIG  THAT  EXP(-A)  IS  NOT  ACCURATE 

I  SN 

0015 

EVEN= (2*1 F/2 l-F ) .EO.O 

I  SN 

0016 

I  FI EVEN.OR.IF.GT.2.ANO..NOT.BIGX) 1  Y=EXP(-A) 

I  SN 

0018 

IF(EVEN)  S=Y 

I  SN 

0020 

I  FI .NOT. EVEN)  S=2. 0*NORMAL 1 X ) 

I  SN 

0022 

CH  I  PRB=  S 

I  SN 

0023 

IFIF.LE.2)  RETURN 

ISN 

0025 

X=0. 5* I F- l . 0 ) 

ISN 

0026 

I F ( EVEN )  1=1.0 

I  SN 

0028 

I F  <  .NOT. EVEN)  Z=0.5 

I  SN 

0030 

IF( .NOT. BIGX)  GO  TO  2 

ISN 

0032 

IF(EVEN)  F=0 • 

I  SN 

0034 

IF! .NOT. EVEN)  E=0. 5723649 

I  SN 

0036 

C-ALOGI A) 

1  SN 

0037 

1 

E=ALOG( Z 1 ♦ E 

ISN 

0038 

S=EXP(C*Z-A-E)*S 

I  SN 

0039 

Z=Z+1.0 

I  SN 

0040 

IFIZ.LF.X)  GO  TO  1 

I  SN 

0042 

CHI PRB=S 

ISN 

0043 

RETURN 

I  SN 

0044 

2 

IFIEVEN)  E= 1 . 0 

I  SN 

0046 

IFI .NOT. EVEN)  E  =  0. 564 1  896/SORT  I  A) 

I  SN 

0048 

C  =  0. 

I  SN 

0049 

3 

E=E*A/Z 

I  SN 

0050 

C  =  C*E 

1  SN 

0051 

1=1* l .0 

ISN 

0052 

IFIZ.LE.XI  GO  TO  3 

ISN 

0054 

CHI PRB=C*Y*S 

I  SN 

0055 

RETURN 

1  SN 

0056 

END 

****** 

END  OF 

COMPILATION  ****** 

U  (IF  A 

JOB 

STATISTICS  —  291  CARDS  READ  —  330  LINES  PRINTED  — 

0  CARDS 

DATE  69.171/13.00.05 


0  CARDS  PUNCHED  —  0.53  MINUTES  EXECUTION  TIME 


-t  > 
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LEVEL  lb  (  1  JULY  68) 


OS/360  FORTRAN  H 


DATE  69.171/12.59.23 


COMPILER  OPTIONS  -  NAME*  M A  IN , OPT =02 , L I NECNT=59 , SOURC E , E BCD IC , NOL I  ST , NODECK , LOAD, NOMAP , NOEO I T , 


ISN  0002 


ISN  0003 
ISN  0004 
ISN  0005 
ISN  0006 

ISN  0007 
ISN  0008 
ISN  0009 


ISN  0010 


DIVISION  OF  EDUCATIONAL  RESEARCH  SERVICES 
UNIVERSITY  OF  ALBERTA 
MULV12 

PURPOSE:  CALCULATE  DISCRIMINANT  SCORES  AND  CLASSIFY  SUBJECTS 

BASED  ON  CONDITIONAL  PROBABILITY  AND  BAYES  RULE 
USING  EACH  GROUP  DISPERSION  MATRIX 


CARD 


INPUT 
1  I 
2) 


ALL  GROUPS 
INPUT  DATA 


TITELEI20A4) 

PARAMETERS! 1615)  NP ,NG , NV , NF , I PROB , N I D 
NP  NO  OF  PERSON  TO  BE  CLASSIFIED 

NG  NO  OF  GROUPS 

NV  NO  OF  VARIABLES  IN  ORIGINAL  SPACE 

NF  NO  OF  VARIABLESIFACTORS)  AFTER  TRANSFORMATION 

IPROB  INDICATES  A  PRIORI  PROBABILITY  TYPE  FOR  APPLYING 

BAYES  RULE  TO  CALCULATE  A  POSTERIOR  PROBABILITY  WHEN 
THE  TEST  SCORES  ARE  GIVEN  FOR  EACH  INDIVIDUAL  TO  BE 
CLASSIFIED 

1- PROPORTIONAL  TO  NO  OF  OBSERVATION  IN  EACH  GROUP 
USED  TO  OBTAIN  INPUT  MATRICES 

2- TO  BE  SPECIFIED  BY  CARD  INPUT 
IF  NOT  I  OR  2  ASSUME  EQUAL  PROPAB I L I TY  FOR 

NID  IF  NI 0- 1  INDICATE  THERE  ARE  ID  NUMBERS  FOR 
CARDI6AI) .OTHERWISE  ASSUME  NO  ID  NUMBER 
FORMAT  FOR  INPUT  MATR I CES I 20A4 I 
C-NGXNV  CENTROID  MATRIX  IN  ORIGINAL  SPACE 
D-NVXNV  DISPERSION  MATRIX  IN  ORIGINAL  SPACE 
A-NVXNF  TRANSFORMATION  MATRIX 
NG  GROUP  DISPERSION  MATRIX  OF  SIZE  NVXNV 
(4,5,7  ARE  OUT  PUT  FROM  MULV10  PROGRAM) 

IF  I PR0B=1 ,  NO  OF  OBSERVATIONS  IN  EACH  GROUP  USED  FOR 
POPULATION  STUOY  (L6I5) 

IF  IPR0B=2  INPUT  A  PRIORI  PROBABILITY  FOR  EACH  GROUP 
( I6F5.5) 

FORMAT  FOR  THE  DAT  A ( 20A4 ) 

INPUT  SAMPLE  OATA 
A  BLANK  CARD 
CLASSI ,MXOUT,CHIPRB 

SSP  ARRAY,CCPY,GMPRD,GMTRA,GTPRO,MINV,MPRD,MSTR,LOC 
K.  BAY 

II  CURRENTLY  DIMENSIONED  TO  ACCOMOOATF  UP  TO  50 
VARIABLES, 25  GROUPS, ANO  24  DISCRIMINANT  SCORES 
DIMENSION  TITLE(20I,FMT(20),A(50,25),D(50,50),C(50,50),CC(25,50) , 
10DI  25,2  5) ,S<  2500  ) ,P I  50  I , PPI 50  I ,NCELL ( 50 ) , LL I  50 1 , MM( 50 1 , SS I  50 , 50 ) , 
2SSSI 50,50) .DATA  150) , DG I  300,25 1  , DPI  50) 

100  FORMAT  I 20A4I 

101  FORMATI 1H1 , 10X.20A4) 

102  FORMATI 1615) 

103  FORMATI 1H0, ‘FORMAT  FOR  TRANSFOR MAT  I  ON , DI SPFR S ION,  AND  CENTROID  MAT 
IRIX') 

104  FORMATI 1X.20A4) 

105  F0RMATI1X, ‘FORMAT  FOR  THE  0 AT4 «  ) 

106  FORMATI  IHO ,  ‘NO  OF  SUB JECTS' , 1 9X , I  5 , / , 1 X , • NO  OF  GROUPS • , 2 1 X , I  5 , / , 

1 1 X , ' NO  OF  VARIABLES* , 18X, 15, /, IX, 'NO  OF  VARIABLES  IN  REDUCED  SPACE 
2  ■  ,  1  X  ,  1 5  ) 

107  FORMATI IHO, 'NO  OF  OBSERVATION  IN  EACH  CELL  USED  FOR  ORIGINAL  DATA* 
1) 


3) 

4) 
51 
6) 

7) 

8) 


91 

10) 

11) 

SUBPROGRAMS 

PROGRAMMER 

REMARKS 


ISN  0011 
ISN  0012 
ISN  0013 
ISN  0014 
ISN  0015 
ISM  0016 
ISN  0017 
ISN  0019 
ISN  0020 
ISN  0021 
ISN  0022 
ISN  0023 
ISN  0024 
ISN  0025 
ISN  0026 
ISN  0027 
ISN  0028 

ISN  0029 
ISN  0030 
ISN  0031 
ISN  0032 
ISN  0033 
ISN  0034 
ISN  0035 
ISN  0036 
ISN  0037 
ISN  0038 
ISN  0039 

ISN  0040 
ISN  0041 
ISN  0042 

ISN  0043 
ISN  0044 

ISN  0045 
ISN  0046 
ISN  0047 
ISN  0048 
ISN  0049 
I SN  0050 
ISN  0051 
ISN  0052 
ISN  0053 
ISN  0054 
ISN  0055 


108  FORMATI IX, 1615) 

109  FORMAT! 16F5.5) 

110  FORMATI IHO, 'NO  OF  OBSERVATION  IN  EACH  GROUP  IN  POPULATION  STUDY') 

111  FORMAT! /,/, IHO, 'GROUP  DISPERSIONS') 

112  FORMATI IHO, 'GROUP  S • , I  3 ) 

1  READI5, 100, END* 9999)  TITLE 
IF(TITLE(l).EQ.TITLEI2l)  GO  TO  9999 
WRITEI6,101)  TITLE 

RE  AD  I  5, 102)  NP,NG,NV,NF,IPROB,NID 
WRITE(6,106)NP,NG,NV,NF 
READI5, 100)  FMT 
WR  I  TF I  6 , 103 ) 

WR I TE I  6 , 104 )  FMT 
DO  2  I = 1 , NG 

2  RE  AD! 5 , FMT )  I C<  I , J ) , J* 1 , NV ) 

CALL  ARRAY! 2, NG.NV, 50,50, C,C> 

CALL  MXOUT!C, NG,NV, 0,60, 132, 1,40, 40H  INPUT  CENTROID  MATRIX  GROUPS 
18Y  VARS  ) 

DO  3  I = l , NV 

3  RE  A  D ( 5 , FMT  )  ID!  I , J ) , J* 1 , NV ) 

CALL  ARRAY! 2,NV ,NV, 50,50,0, D) 

CALL  MXOUT! D.NV.NV, 0,60, 132, 1,24, 24H  INPUT  DISPERSION  MATRIX) 

DO  4  1*1, NV 

4  REA0I5,FMT)  I  A  (  I , J I , J* l , NF I 
CALL  ARRAY(2,NV,NF,50,25,A,A) 

CALL  MX0UT(A,NV,NF,0,60, 132, 1,28, 28H  INPUT  TRANSFORMATION  MATRIX) 
CALL  GMPRDIC,A,S,NG,NV,NF) 

CALL  GMTRA! S.CC.NG.NF ) 

CALL  MXOUT ( CC  , NF ,NG , 0 , 60 , 1 32 , 1 , 52 , 52H  CENTROID  MATRIX  IN  REDUCED 
1SPACE  FACTOR  BY  GROUPS  ) 

CALL  GTPRD(A,D,S,NV,NF,NV) 

CALL  GMPRD(S,A,OD,NF,NV,NFI 

CALL  MXOUT!DD,NF,NF ,0,60, 132, 1, 36.36H  DISPERSION  MATRIX  IN  REDUCED 
1  SPACE  I 

CALL  MI NV! OD.NF ,DET,LL, MM) 

CALL  MXOUT! DD.NF.NF ,0,60, 132, 1, 32, 32H  INVERSE  OF  DISPERSION  MATRIX 
1  ) 

NFH* I (NF+l)*NF)/2 
WRITEI6.111) 

DO  14  I G= 1 , NG 
WRITEI6, 1121  IG 
DO  12  I  *  1 »  NV 

12  RE  AD ( 5 , FMT )  I D ! 1 , J ) , J= 1 , NV ) 

CALL  ARRAYI2.NV.NV, 50,50,0,0) 

CALL  MXOUT(D,NV,NV ,0,60, 132, 1,20, 20H  IN  ORIGINAL  SPACE  ) 

CALL  GTPRD(A,D,S,NV,NF,NV) 

CALL  GMPRD(S,A,D,NF,NV,NF ) 

CALL  MXOUTIO, NF.NF, 0,60, 132, 1,20, 20H  IN  REDUCED  SPACE  ) 


ID, NOXREF 


. 


' 

> 


' 
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I  SN 

0056 

CALL  MINVID.NF.DET, LL.MM) 

I  SN 

0057 

DP! IG)=DET 

ISN 

0058 

CA1L  MXOUTID, NF.NF, 0,60, 132, 1,24, 24H  INVERSE  OF  DISPERSION  1 

I  SN 

0059 

CALL  MSTR(D,S.NF,0, 1) 

ISN 

0060 

DO  13  J  =  1 ,NFH 

ISN 

0061 

13 

DG(J.IG)-SIJ) 

I  SN 

0062 

14 

CDNT INUE 

I  SN 

0063 

WRITEI6, 101 1 

ISN 

0064 

CALL  ARKAY(2,NFH,NG,300,25,OG,DG) 

I  SN 

0065 

CALL  MXOUTIOP, 1 ,NG, 0,60, 132, 1,20, 20H  DETERMINANTS  ) 

I  SN 

0066 

IF  1 IPROB.NE.il  GO  TO  11 

I  SN 

0068 

READI5, 102)  (NCELLI I  I , 1  =  1 ,NG) 

I  SN 

0069 

WRITEI6, 110) 

ISN 

0070 

WRITE  16, 108 )  ! NCELL ( I ) , I  =  1 , NG) 

I  SN 

0071 

FCELL=0.0 

I  SN 

0072 

DO  5  I = 1 , NG 

ISN 

0073 

5 

FCELL=FCELL* NCELL! I  I 

I  SN 

0074 

DO  6  1=1, NG 

ISN 

0075 

6 

P(  I ) =NC  ELL < I l/FCELL 

I  SN 

0076 

GO  TO  10 

ISN 

0077 

1  l 

CONTINUE 

ISN 

0078 

7 

IF! IPR0B.NE.2)  GO  TO  9 

I  SN 

0080 

RE  ADI  5 , 109 )  IP! I ) , 1  =  1 , NG ) 

I  SN 

0081 

GO  TO  10 

ISN 

0082 

9 

PPP=1.0/NG 

I  SN 

0083 

DO  B  1=1, NG 

I  SN 

0084 

8 

P< I )=PPP 

I  SN 

0085 

10 

CALL  MXOUT ( P, l ,NG,0,60, 132, 1 ,24,24H  PRIOR!  PROBABILITY  USED) 

I  SN 

0086 

READ! 5, 100)  FMT 

I  SN 

0087 

WRITE(6,105) 

ISN 

0088 

WRITF|6,104)  FMT 

I  SN 

0089 

WRITE (6, 101) 

I  SN 

0090 

CALL  CLASSI (NP.NG.NV, NF, NCELL, P, A, CC, FMT, DD,PP,S,SS,SS S, DATA, NID 
IDG, DP) 

ISN 

009  1 

GO  TO  1 

I  SN 

0092 

9999 

STOP 

ISN 

0093 

END 

****** 

END  OF 

COMP IL AT  I  ON  ****** 

LEVEL 

16  (  1 

JULY  6B) 

OS / 360  FORTRAN  H 

DATE  69.171/12.59.37 


I  SN 


I  SN 

I  SN 
I  SN 
1  SN 
I  SN 
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I  SN 

I  SN 

1  SN 
I  SN 
I  SN 
I  SN 
1  SN 
I  SN 
ISN 
I  SN 
I  SN 
ISN 
ISN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
ISN 
I  SN 
I  SN 
I  SN 
ISN 
I  SN 
I  SN 
I  SN 
I  SN 
I  SN 
ISN 
ISN 
I  SN 
I  SN 
I  SN 
ISN 
I  SN 


0002 


0003 

0004 

0005 

0006 

0007 

0008 

0009 

0010 

0011 
0012 
0013 
0015 
0016 
0017 
001» 
0019 
0020 
002  1 
0022 
0023 
0024 
0025 
0026 
0027 
00  2  R 
0029 
0030 
0031 
0032 
0033 
0034 
0035 
0036 
0037 
003H 
0039 
0040 
0042 
0043 
0044 
0045 
0046 


COMPILER  OPTIONS  -  NAM  F  =  M A  IN , OPT =02 , L INFCNT= 59 , S OURC E ,E BCDI C , NOL I  ST , NODECK, LOAD, NOMA P , NOED I T , 
SUBROUTINE  C1ASSII NP, NG.NV.NF, NCELL, P.FL, CENT, FMT, D,PP,  CHI, XX, GM, 

IDDiN ID  ,  DG, DP  ) 

PURPOSE  : 

CLASSIFY  SUBJECTS  BASED  ON  NV  OBSERVATION  AND  POPULATION 
CENTROID.  DISPERSION  IN  REDUCED  SPACE  AND  TRANSFORMATION 


MATRICES 

NP 

NG 

NV 

NF 

P 


NO  OF  PERSONS  TO  BE  CLLASI FIFO 
NO  OF  GROUPS 
NO  OF  INPUT  VARIABLES 

NO  OF  VARIABLES  IN  REDUCED  SPACE,  USUALLY  NG-1 
INPUT  VFCTOR  INDICATING  A  PRIORI  PROBABILITY  FOR  EACH 
GROUP 

FL  INPUT  TRANSFORMATION  MATRIX  OF  SIZE  NVXNF 

CFNT  INPUT  NFXNG  CENTROID  MATRIX 

FMT  FORMAT  FOR  THE  DATA 

D  INVERSE  OF  DISPERSION  MATRIX  IN  REDUCED  SPACE 

LL.MM  WORKING  VECTORS 

DD.CHI  WORKING  VECTORS  OF  SIZE  NV  AND  NG 

NID  IF  EO  TO  1  ID  NO  IS  INPUT  FOR  EACH  SURJECT, 

OTHERWISE  ASSUME  NO  ID  INPUT 

DIMENSION  NCELL(l),Plll,PP(l),FL(l),CENT(l),FMT(l)fDIl),DD(ll, 
l XXI 1 1 ,GM( 1 | ,CHl I  1 ) ,DG< 1 ) ,0P(1» 

DIMENSION  I D l 6 1 

DATA  ID/ •*’,•*•,•*•,•*’, / 

100  FORMAT ( 1X.BX , 'CHISO' ,BX, 101 1 X.FB.4 I ,/ ,  1 X , 2 IX , 101 1 X , F8. 4 1 ) 

101  FORMAT! IX, BX, 'POSTER IOR  P>t2XfL0fiXiF8.4),/l2(lXf21Xti0(lXtF8.«)t 
1/,  )  ) 

102  FORM  AT I1H0,6A1,2X,' SCORE’, BX,10(1X,F8. 4)1 

103  FORMAT! 1 X , B X , • C ONDI T I  ON  P’,2X,10(1X,F8.4I,/,2(IX,71X,10(1X,F8.4), 

1/,  )  ) 

104  FORM  AT! IX, • . CLASSIFICATION . • , 5X , • COND I T I  ON AL : • , 2X , 1 2 , 5X , 

l’BAYSIAN  RULE:  •  ,2X,  12,  IX,  • . ') 

NFH= I INF*l)*NF)/2 
DO  11  MM= l , NP 
IF  (NID.EO. 1 )  GO  TO  1 
RLADI5.FMT)  I  DO ( J )  , J  =  1 , NV ) 

GO  TO  7 

1  READI5.FMT)  (!D(K),K=1,6I,IDD(JI,J=1,NV) 

2  CALL  GMPRD! OD.FL, XX, 1 ,NV,NF ) 

WR I TE ( 6 , 102 )  ( I D ( K ) , K= 1 , 6 1 ,1 XX! J) ,J=l,NF ) 

DO  5  M=  1 ,  NG 

CALL  CCPY(CENT,M,GM,NF,NG,0) 

CALL  CCPY! DG,M,D,NFH,NG,0» 

DO  3  L= 1 , NF 

3  GM(L)=XXI LI— GM( L 1 

CALL  MPRD(D,GM,DD,NF,NF, 1,0, 11 

X2=0.0 

DO  4  J=  1 ,  NF 

4  X2=X2*GM( J)*00I J) 

5  CHI (  M )  =  X2 


SUM=0 • 0 
00  6  J= 1 , NG 

PP(JI  =  ( PIJ)*EXP(— CHI(J) /2.01 1/1  SORT (DPIJI ) 1 

6  SUM=SUM*PP(J  1 
DO  7  J= 1 , NG 

7  PPI J)=PP( Jl/SUM 

WR l T E I  6 , 1 00 1  ICHI ( J) , J=1,NG) 

WRITE(6,101)  I PP< J) , J=l,NG) 

NCB=1 

DO  8  J=2 , NG 

8  IF (PPIJ) •  GT .PP(NCfi) 1  NCB= J 
DO  9  J= 1 , NG 

9  PP(  J ) =CH I PRB  ICHI  (J)  ,NF) 

WRITE(6,1031  I  PP< J) , J=1,NGI 
NCA=  1 

on  10  J=2,NG 


ID, NOXREF 


. 

' 


<■  \ 

> 

■ 

. 
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I S N  0047 
ISN  0049 
ISN  0060 
ISN  0061 
ISN  0052 


10  IFIPP! Jl .GT.PPINCAI )  NCA=J 
WR I TE ( 6 , 1041  NCA.NCB 

11  CONTINUE 
RETURN 
END 


******  gMQ  OF  COMPILATION  ****** 
LEVEL  16  I  1  JULY  68) 


QS/360  FORTRAN  H 


OATE  69.171/12.59.48 


ISN  0002 


COMPILER  OPTIONS  -  NAM  E=  MA I N , 0PT=02 , L I NECNT*59 , SOURC E, EBCO I C , NOL I  ST , NOOECK , LO AO, NOMAP , NOFO I T , I D, NOXRFF 
SUBROUTINE  MXOUT I  A , N , M , MS , L INS ,  I POS, I SP , NUMHOL , T I TLE ) 


A 

N 

M 

MS 


LINS 

IPOS 


: NAME  OF  OUTPUT  MATRIX 
: NUMBER  OF  ROWS  IN  A 
[NUMBER  OF  COLUMNS  IN  A 
[STORAGE  MUOE  OF  A 
0  GENERAL 

1  SYMETRIC 

2  DIAGONAL 

[NUMBER  OF  PRINT  LINES  ON  THE  PAGE  I  USUAL  LY  60) 

[NUMBER  OF  PRINT  POSITIONS  ACROSS  THE  PAGE (USUALLY  132) 


c 

ISP  :LINE  SPACING  CODE,  1  FOR  SINGLE  SPACE  2  FOR 

DOUBLE 

c 

SPACE 

I  SN 

0003 

DIMENSION  A( 1) ,B( 8) ,TITLE(20) 

ISN 

0004 

1 

FORMAT  !1H0,/,1H0,20A4) 

I  SN 

0005 

2 

FORMAT! 12X.8HC0LUMN  ,  7  <  3X  ,  l  3,  l  OX)  ) 

I  SN 

0006 

3 

FORMAT!  1H  1 

ISN 

0007 

4 

F0RMATI1H  , 7X , 4HR0W  ,  I  3 , 7( E 16. 6 1 ) 

ISN 

0008 

5 

FORMAT!  IHO , 7  X , 4HR0W  ,I3,7(E16.6)) 

I  SN 

0009 

NN=( NUMH0L*3)/4 

I  SN 

0010 

WR I TE ( 6, 1 )  ITITLEIJI ,J=l,NN) 

I  SN 

0011 

J  =  1 

I  SN 

0012 

NEND=IP0S/16-I 

1  SN 

0013 

LEND=(LINS/ISP)-2 

ISN 

0014 

10 

LSTRT= 1 

ISN 

0015 

20 

CONTINUE 

I  SN 

0016 

JNT=JfNEND-1 

I  SN 

0017 

31 

IF(JNT-M)33, 33,32 

I  SN 

0018 

32 

JNT  =  M 

ISN 

0019 

33 

CONTINUE 

I  SN 

0020 

WRITE (6,2)  (JCUR,JCUR=J,JNT) 

I  SN 

0021 

IF(lSP-l)  35,35,40 

ISN 

0022 

35 

WR I TE ( 6 , 3 ) 

I  SN 

0023 

40 

LTEND=LSTRT*LEND-1 

I  SN 

0024 

DO  80  L  =  LSTR T , LTENO 

I  SN 

0025 

DO  55  K= l , N END 

I  SN 

0026 

KK»K 

I  SN 

0027 

JT=J+K-1 

ISN 

0028 

CALL  LOCIL, JT,IJNT,N,M,MS) 

ISN 

0029 

B ( K ) =0 . 0 

ISN 

0030 

IF(IJNT)  50,50,45 

I  SN 

0031 

45 

B(K)=A( I JNT ) 

I  SN 

0032 

50 

CONTINUE 

I  SN 

0033 

IF(JT-M)  55,60,60 

I  SN 

0034 

55 

CONTINUE 

I  SN 

0035 

60 

IFIISP-1)  65,65,70 

1  SN 

0036 

65 

WRITEI6.4)  L , ! B( JW) , JW=1 , KK) 

ISN 

0037 

GO  TO  75 

1  SN 

0038 

70 

WRITEI6.5)  L,( B(JW) , JW  =  1 ,  KK ) 

ISN 

0039 

75 

IF(N-L)  85,85,80 

I  SN 

0040 

80 

CONTINUE 

I  SN 

0041 

LSTRT=LSTRT*LEND 

ISN 

0042 

GO  TO  20 

I  SN 

0043 

85 

IF(JT-M)  90,95,95 

I  SN 

0044 

90 

J - J  T  + 1 

I  SN 

0045 

GO  TO  10 

I  SN 

0046 

95 

RETURN 

I  SN 

0047 

END 

****** 

END  OF 

COMP  I L  AT  I  UN  ****** 

LFVFL  ! 

16  (  l 

JULY 

68) 

0S/360  FORTRAN  H 

COMPILER  OPTIONS  -  NAME=  M A  I N , OPT =02 , L I NECNT  =  59 , SOURC E , E BCD  I C , NOL I  ST , NOOECK 

I  SN 

0002 

FUNCTION  CHIPRBICHI ,NDF) 

c 

COMPUTES  PROBABILITY  OF  CHISQUARE  CHI  WITH  NDF  DEGREES 

OF  FREEDOM 

I  SN 

0003 

EXTERNAL  SORT, ERF 

I  SN 

0004 

REAL  NORMAL 

I  SN 

0005 

INTEGER  F 

I  SN 

0006 

LOGICAL  BIGX.FVEN 

I  SN 

0007 

NORMAL(X)=O.5*!l.0*ERF(O. 70 71068* 1 - SQR T ( X ) ) ) 1 

c 

SEE  KENNEY  AND  KEE P  INGI 195 1 )  VOL. 2,  P.43,  WHERE  ERF  IS 

CALLED  G. 

I  SN 

0008 

F=NDF 

ISN 

0009 

X  =  CH  I 

I  SN 

0010 

CHIPRB=1.0 

I  SN 

0011 

IF! X.LE.O..OR.F.LT.1 )  RETURN 

c 

LABEL  *WRONG*  IS  OMITTEO  IN  FAVOR  OF  RETURNING  CHIPRB= 

1.0 

I  SN 

0013 

A=0.5*X 

ISN 

0014 

B  I  GX  =  A.  GT .  10. 

c 

BIGX  SHOULD  BE  TRUE  WHEN  X  IS  SO  BIG  THAT  FXP(-A)  IS  NOT  ACCURATE 

I  SN 

0015 

EVEN=(2*(F/2)-F 1.EQ.0 

I  SN 

0016 

IF(EVEN.0R.(F.GT.2.AND..N0T.BIGX) 1  Y=EXPI-A) 

I  SN 

0018 

IF(EVFN)  S= Y 

I  SN 

0020 

IF! .NOT. EVEN)  S=2.0*NORMAL < X ) 

I  SN 

0022 

CHI PRB=S 

I  SN 

0023 

IF1F.LE.2)  RETURN 

I  SN 

0025 

X=0. 5* ( F- 1 . 0 ) 

I  SN 

0026 

IFIFVENI  Z=1.0 

I  SN 

0028 

IF! .NOT. EVEN)  Z=0.5 

I  SN 

0030 

IF  1 .NOT. BIGX I  GO  TO  2 

I  SN 

0032 

IF(EVEN)  F=0. 

I  SN 

0034 

IF! .NOT. EVEN)  E=0. 5723649 

I  SN 

0036 

C- ALOG ( A ) 

i  SN 

003  7 

l 

E=ALOG( Z l+E 

I  SN 

0038 

S=FXP(C*Z-A-E)FS 

I  SN 

0039 

Z=Z*1 .0 

I  SN 

0040 

IFIZ.LE.X)  GO  TO  1 

I  SN 

0042 

CH 1 PRB- S 

DAT  F  69.171/12.59.55 
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ISN  0043 

RETURN 

ISN  0044 

2 

IFIEVENI  E=1.0 

ISN  0046 

IF (.NOT. EVEN)  E=0. 5641 896 /SORT) A) 

ISN  0048 

C  =  0. 

ISN  0049 

3 

E=E*A/Z 

ISN  0050 

C  =  C*E 

ISN  0051 

Z=Z*1.0 

ISN  0052 

IF(Z.LE.X)  GO  TO  3 

ISN  0054 

CHIPRB=C*Y+S 

ISN  0055 

RETURN 

ISN  0056 

END 

******  f Mf)  OF  COMPILATION  ****** 


0  OF  A  JOB  STATISTICS  —  320  CAKOS  READ 


359  LINES  PRINTED 


0  CARDS  PUNCHED  —  0.55  MINUTES  EXECUTION  TIME 


